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Derivative Nucleic Acids and Uses Thereof 



Field of Invention 

This invention relates to the field of analyzing complex mixtures of nucleic acids for 
the presence of specific sequences and for the presence of specific sequence variations. 

10 Background 

Methods of the invention can be used to analyze complex mixtures of nucleic acids in 
small amounts of source material. Common applications are the quantification of messenger 
RNA levels for specific genes in an organism or tissue or the determination of the allele 

1 5 status of genetic polymorphisms in genomic DNA. Often there is a requirement to analyze 
many different nucleic acid sequences in a single sample, and it is often necessary to amplify 
the target nucleic acids or signal molecules to obtain detectable signals in an assay. 

DNA microarrays provide the ability to analyze many target sequences in a sample. 
However, conventional microarray analysis is limited by the cost and difficulty of preparing 

20 large numbers of target molecules from limited amounts of sample and by poor hybridization 
specificity. Multiplexed analysis - amplifying target sequences or target-specific probe 
molecules from many targets in a single sample - can minimize the cost of reagents and the 
consumption of precious samples. Methods with better sequence specificity than simple 
hybridization reactions - e.g., analytical methods based on activities of nucleic acid 

25 modifying enzymes - provide improved reliability and accuracy of the quantification of 
specific sequences and the detection of specific polymorphisms. 



Multiplex polymerase chain reactions (PCR) have been used to decrease the sample 
preparation for scoring genetic variation by hybridization on DNA microarrays. 
30 Unfortunately many of the target sequences fail to amplify optimally, especially when large 
multiplex factors (large numbers of target sequences amplified simultaneously) are used 
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(Wang et al., Science 280 :1077, 1998). Multiplex PCR has been performed with 5' 
extensions or "tails" on target-specific PCR primers which incorporate universal priming 
sequences into amplification products (Brownie et al, Nuclejc^cjg^^ mj . 
U.S. patent 5,858,989; Jeffreys et al; Favis et al, m^m^mo^U^l, 2000) ' ' 
5 Amplification is performed for a limited number of cycles with these tailed target-specific 
pnmers, and men further amplification is performed with high concentrations of universal 
primers with the same sequence as the 5' tails of the target-specific primers. 

Multiplex amplification with universal primers after appending 5' extensions on 
amphficafcon products with adapter primers has been used with strand displacement 
0 amplification (US Patent 5,422,252, Walker et al). Generic or universal primers have been 
used to amplify ligatable probes (US patent 5,876,924, Zhang et al, Thomas et al Arch 
MloLLabMedI23:1170, 1999). The use of generic primers for probe amplification is 
advantageous compared to convention target amplification with PCR, since primer binding 
and amplification is not subject to the variability of target sequences. The ligatable probes 
5 may be pan's of linear probes or a single circularizable probe. In the case of pairs of linear 
probes the generic primer sequences are incorporated into the 3' and 5' ends of the probes 
respectively (US patent 5,876,924, Zhang et al.). In the case of circularizable probes, generic 
primer sequences are incorporated into the linker region between the target-specific termini 
of the probe. Amplification of circularizable probes may be performed with a single generic 
' Pnmer ("rolling circle amplification", RCA) or with a pair of generic PCR primers (US 
patent 5,876,924, Zhang et al, Thomas et al., Arch Pathol I .ah iu<»h 123: 1 1 70 I9 99) 
Amplified circularizable probes are attractive alternatives to PCR for multiplexed analysis 
(Isaksson and Landegren, Curr Opin Biotechnol 10:1 1, 1999). Ligation reaction provide 
excellent sequence discrimination for scoring SNPs (Thomas et al., mU^^^kMed 
123: 1 1 70, 1999; Favis et al, Nature Biotechnolr^ v i g:56I 

Another type of reaction that provides excellent sequence discrimination and that also 
prov.des amplification of the probes is the Invader assay developed by the Third Wave 
Technologies Company (US patent 5,846,71 7; US patent 5,888,780; US patent 5 985 557- 
US patent 5,994,069; US patent 6,001,567). With this method pairs of probes are hybridized 
to nucleic add targets, and an endonuclease enzyme effects cyclic structure-dependent 
cleavage of one of the probes, if the probe matches the target sequence. A fragment of 
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arbitrary sequence with a ligatable 3' terminus ("flap") is cleaved from the probe by the 
enzyme. The reaction provides substantial amplification and high specificity for scoring of 
SNPs (Ryan et al., Mol Diagn 4 :135, 1999, Griffin et al., Proc Natl Acad Sci U S A 96 :6301, 
1 999) and has great potential for multiplexing the analysis of many targets in a single sample 
5 (Griffin et al., Proc Natl Acad Sci U S A 96 :630 1 , 1 999). 

Capture of target sequences on microarrays of nucleic acid probes is limited by less 
than optimal discrimination between related but non-identical sequences and by the 
secondary structure of target molecules. Analysis of single nucleotide polymorphisms (SNPs) 
by hybridization on oligonucleotide arrays is error prone and requires the use of highly 

10 redundant sets of probes (Wang et al., Science 280 : 1077, 1 998; Cargill et al., Nature Genetics 
22: 231, 1999). A method of enhanced oligonucleotide capture uses partially duplex probes 
and enzymatic reactions (ligation and/or primer extension) to accurately discriminate 
perfectly matched targets from those containing mismatches (Broude et al., Proc. Natl. Acad. 
Sci. 91 : 3072, 1994; US patent 5,503,980, Cantor, Gunderson et al., Genome Research 

15 8 : 1 1 42, 1 998). Analysis of the efficiency of hybridization of various nucleic acid sequences 
(Mir and Southern, Nature Biotechnology 17 :788,19991 has demonstrated that the 
hybridization is most efficient at single stranded regions near duplex stems, probably due to 
the presence of helical order and the absence of tertiary interactions. Ligation and polymerase 
reactions have been shown to provide excellent discrimination of perfectly and mismatched 

20 probes required for the scoring of SNPs (Thomas et al., Arch Pathol Lab Med 123 : 1 1 70, 
1 999; Favis et al, Nature Biotechnology 18 :561 , 2000; Pastinen, et al., Genome Research 
7:606, 1 997). Thus capture of nucleic acid targets on partially duplex probes coupled with 
enzymatic discrimination provides for very high specificity. The probes capture only terminal 
sequences on target molecules; internal target sequences that match the single stranded 

25 overhang sequences of the probes are not captured. 

Summary of the Invention 

In one aspect, the invention features, a method for multiplexed analysis of a plurality 
30 of target nucleic acid sequences in a sample. The method provides a derivative nucleic acid 
for each target sequence analyzed and present in the sample. A derivative nucleic acid is a 
nucleic acid which includes a capture tag sequence at one, or both, of its 3' and 5' termini. 
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The denvative sequence is produced only upon the hybridization of a probe or primer which 
mcludes the sequence tag as an internal fragment to the target sequence for which it is 
specie. The sequence tag is analyzed, e.g., by its ability to bind to a capture probe e g a 
capture probe bound to a insoluble substrate, e.g., a bead, or an ordered array of partially 
5 duplex capture probes. The presence of the derivative nucleic acid, and its characteristic 
terrmnal sequence tag, is diagnostic of the presence of a selected target sequence The 
method can evaluate, e.g., identify or quantitate, the presence of one, or preferably a plurality 
of specific nucleic acid sequences, or the presence of specific sequence variations. The 
method includes the steps of: 
1 0 providing, for each target nucleic acid sequence to be analyzed, at least one 

probe/primer molecule which probe/primer molecule includes a region of sequence 
substantially complementary to a sequence in the target nucleic acid sequence and a region 
that k not located at either terminus of the probe/primer and which includes a capture tag 
sequence; 

1 5 forming a reaction mixture which includes the probe/primer molecules and the target 

sequences under conditions such that, if a probe/primer molecule specific for a target 
sequence and that target sequence are both present, one or a plurality of derivative molecules 

sequence, is generated, thereby producing a derivative nucleic acid suitable for evaluation- 
optxonally, evaluating the presence of one or more sequence tags, e.g., by capturing 
the derivative nucleic acid molecules by hybridizing the tag sequences to complementary 
smgle stranded overhangs on partially duplex probes, which are preferably spatially 
separated, (e.g., on beads or on an ordered array) such that the tag sequences are contiguous 
w lth the double stranded regions of the partially duplex probes and thereby analyzing said 
captured derivative nucleic acid molecules. In preferred embodiments the captured derivative 
nuclei acids are further analyzed, e.g., SNP's or other sequence events are evaluated, e.g., by 
a further reaction, e.g., an extension reaction. 

In a particularly preferred embodiment the derivative nucleic acid is Iigated to a 
capture probe, e.g., to a partially duplex probe and then washed, preferably stringently 
washed, to achieve highly specific capture. 

In a preferred embodiment a partially duplex probes include a single molecule which 
» partially self-complementary and forms a hairpin structure with either a 3' or 5' overhang 
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and optionally which contains a chemical moiety that allows for the immobilization and 
spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes include a pair of molecules 
which are bound by non-covalent means to form a structure with either a 3' or 5' overhang 
5 and optionally which contains a chemical moiety on one of the molecules that allows for the 
immobilization and spatial separation of the probe molecules. 

In a preferred embodiment the partially duplex probes consists of a pair of molecules 
which are bound by covalent means to form a structure with either a 3" or 5' overhang and 
which contains a chemical moiety on one of the molecules that allows for the immobilization 
1 0 and spatial separation of the probe molecules. 

In another aspect, the invention features, a method of providing a derivative nucleic 
acid having single-strand overhang suitable for analysis. The method includes: 



embodiment the universal primer sequence is different for each primer of a pair, but the same 
pair is used for all targets) (occasionally referred to herein as Uf); 



15 



providing a first and second primer, 

wherein said first primer includes, preferably in the order of 5' to 3', 

a first region which includes a universal primer sequence (in a preferred 



a second region which includes a capture tag sequence and a 
cleavage site, e.g., a site for cleavage by a restriction enzyme sequence 



20 



(occasionally referred to herein as R/T), and 



a third region which can which can hybridize to a first region on 
the target nucleic acid sequence (occasionally referred to herein as Xf), 
wherein said second primer includes, preferably in the order of 5' to 3 



25 



a first region which includes a universal primer sequence 
(occasionally referred to herein as Ur); 



30 



a second region which includes a capture tag sequence (the capture 
tag sequence on the second primer can be the same or different, and is 
preferably of different sequence, from that of the capture tag of the first primer) 
and a cleavage site, e.g., a site for cleavage by a restriction enzyme sequence 
(occasionally referred to herein as R/T2), and 



a third region which can which can hybridize to a second region on the 
target nucleic acid sequence (occasionally referred to herein as Xr), 
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forming a reaction mixture which includes the first and second primers and the target 
nucleic acid, and using the target as a template, extending the primers along the target nucleic 
acid, to produce an extended target strand, which preferably includes, in order, a universal 
5 primer sequence, a capture tag sequence, target sequence, a capture tag sequence, and a 
universal primer sequence; 

optionally amplifying, using the first and second primers, the extended target strand; 

» contacting an extended target strand with a universal primer which binds to the 

universal primer sequence (and preferably not to regions on the amplified target strand other 
than the universal primer sequence) and extending the universal primers along the extended 
target strand to synthesize a extended target strand which includes, in order, a universal 
primer sequence, a capture tag sequence, target sequence, a capture tag sequence, and a 
universal primer sequence; 

optionally amplifying, using universal primers, the extended target stranded species; 

cleaving at the cleavage site of one or both ends of a double stranded extended target 
molecule to provide a derivative nucleic acid which includes a double stranded molecule 
having an overhang which includes the capture tag sequence, preferably at one or both of the 
3' and 5'termini, 

thereby producing a target molecule having an overhang. 

Methods of the invention allow for multiplexed reactions, e.g., reactions in which two 
or more targets, and preferably as many as 10, 50, 100, 200, 500, 1000, or 5000, are analyzed. 
Targets can be on the same molecule or can be on different molecules. Thus, in a preferred 
embodiment the reaction mix includes a second target, and a third and a forth primer are 
included in the reaction mix, 



wherein the third primer includes, preferably in the order of 5' to 3', 
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a first region which includes a universal primer sequence (in a 
preferred embodiment the universal primer sequence is different for each 
primer pair, but the same pair is used for all targets), 

a second region which includes a capture tag sequence (which is 
5 preferably different from the capture tag sequence on one or both of the first 

and second primers) and a cleavage site, e.g., a cleavage site for cleavage by a 
restriction enzyme, and 

a third region which can hybridize to a first region on the second 

target, 

10 wherein the fourth primer includes, preferably in the order of 5' to 3', 

a first region which includes a universal primer sequence; 
a second region which includes a capture tag sequence (which is 
preferably different form the capture tag sequence on one or both of the first 
and second primer, and which can be the same or different, and is preferably 
1 5 of different sequence, from that of the capture tag of the third primer) and a 

cleavage site, e.g., a site for cleavage by a restriction enzyme, and 

a third region which can which can hybridize to a second region on the 
second target, 

forming a reaction mixture which includes the first, second, third and fourth primers 
20 and the two targets, and using the second target as a template, extending the third and fourth 
primers along the second target, to produce an extended second target strand, which 
preferably includes, in order, a universal primer sequence, a capture tag sequence, target 
sequence, a capture tag sequence, and a universal primer sequence (the extended second 
target will include a capture tag sequence which is different from that on the extended first 
25 target); 

optionally amplifying, using the third and fourth primers, the extended second target 
strand produced in the step above; 

contacting an extended second target strand with a universal primer which bind to the 
universal primer sequence (and preferably not to regions on the amplified second target 
30 strand other than the universal primer sequence) and extending the universal primers along 
the extended second target strand to synthesize a extended second target strand which 
includes, in order, a universal primer sequence, a capture tag sequence, target sequence, a 
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capture probe. The template dependent extension of the 3' end of the capture probe is 
diagnostic for capture of the derivative nucleic acid. However, there are cases (e.g., when the 
derivative nucleic acids is to be analyzed by primer extension using a special primer) that the 
3* end should be blocked so that no extension can occur. 
5 In a preferred embodiment a partially duplex probes include a single molecule which 

is partially self-complementary and forms a hairpin structure with either a 3' or 5' overhang 
and optionally which contains a chemical moiety that allows for the immobilization and 
spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes include a pair of molecules 

10 which are bound by non-covalent means to form a structure with either a 3' or 5' overhang 
and optionally which contains a chemical moiety on one of the molecules that allows for the 
immobilization and spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes consists of a pair of molecules 
which are bound by covalent means to form a structure with either a 3' or 5* overhang and 

1 5 which contains a chemical moiety on one of the molecules that allows for the immobilization 
and spatial separation of the probe molecules. 

In a preferred embodiment the array is a three-dimensional array, e.g., a gel array. 
In preferred embodiments hybridization to the array is detected by any of: 
fluorescence, a proximity based signal generating system, or by mass spectrophotometry, 

20 e.g., by MALDI-TOF mass spectrophotometry. 

In preferred embodiments the method includes one or more enzyme mediated 
reactions in which a nucleic acid used in the method, e.g., a capture probe or a derivative 
nucleic acid, is the substrate or template for the enzyme mediated reaction. This can increase 
the specificity of the evaluation. The enzyme mediated reaction can be: an extension 

25 reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a ligation, e.g., a 
reaction catalyzed by a ligase; or a nucleic acid cleavage reaction, e.g., a cleavage catalyzed 
by a restriction enzyme, e.g., a Type lis enzyme. The derivative nucleic acid sequence which 
hybridizes with the capture probe can be the substrate in an enzyme mediated reaction, e.g., it 
can be ligated to a strand of the capture probe or it can be extended along a strand of the 

30 capture probe. Alternatively, the capture probe can be extended along the derivative nucleic 
acid sequence. Alternatively, a separate primer can be contacted with the derivative nucleic 
acid for the analysis. (Any of the extension reactors discussed herein can be performed with 
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more single nucleotide polymorphisms, in a sample. 
The method includes: 
providing a first and second probe, 

wherein said first probe includes, preferably in the order of 5' to 3', 

a first region which includes a universal primer sequence (referred to 
occasionally herein as Uf); 

a second region which includes a capture tag sequence and a cleavage 
site, e.g., a site for cleavage by a restriction enzyme (referred to occasionally 
herein as R/Tl), and 

a third region which can which can hybridize to a first region on the 
target nucleic acid (referred to occasionally herein as X5'), 
wherein said second probe/primer includes, preferably in the order of 3' to 5', 
a first region which can which can hybridize to a second region on the 
target nucleic acid (the first and second region of the target sequence can abut, 
or can be separated e.g., by 1 or up to 2, 3, 4, 5, 10 or 20 nucleotides, in the 
case where the regions, the first and second probe can be joined by ligation, in 
the case where they are separated by one or more nucleotides the first and 
second primer can be joined by polymerase directed synthesis and ligation) 
(referred to occasionally herein as X3'), 

a second region which includes a capture tag sequence (the capture tag 
sequence on the second probe can be the same or different, and is preferably 
of different sequence, from that of the capture tag of the first probe) and a 
cleavage site, e.g., a site for cleavage by a restriction enzyme (referred to 
occasionally herein as R/T2), and 

a third region which includes a universal primer sequence (referred to 
occasionally herein as Ur); 
forming a reaction mixture which includes the first and second probes and the target 
nucleic, under conditions wherein the first and second probes are joined, e.g., by ligation, if 
the target is of a first sequence and not joined if the target is of a second sequence, to produce 
a joined probe, which preferably includes, in order, a universal primer sequence, a capture tag 
sequence, target sequence, a capture tag sequence, and a universal primer sequence, 
preferably in the order 5' to 3 '; 
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they are separated by one or more nucleotides the third and fourth 
probe/primer can be joined by polymerase directed synthesis and ligation), 

a second region which includes a capture tag sequence (which is 
preferably different form the capture tag sequence on one or both of the first 
5 and second primer, and which can be the same or different, and is preferably 

of different sequence, from that of the capture tag of the third primer) and a 
cleavage site, e.g., a site for cleavage by a restriction enzyme, and 
a third region which includes a universal primer sequence; 
forming a reaction mixture which includes the first, second, third and fourth probe 
10 and the two targets under conditions wherein the third and fourth probe are joined, e.g., by 
ligation, if the second target is of a first sequence and not joined if the second target is of a 
second sequence, to produce a second joined probe/primer, which preferably includes, in 
order, a universal primer sequence, a capture tag sequence, target sequence, a capture tag 
sequence, and a universal primer sequence, preferably in the order 5' to 3'; 
1 5 optionally, contacting a second joined probe with a pair of universal primers which 

bind to the universal primer sequences (and preferably not to regions on the second joined 
probe other than the universal primer sequences) and extending the universal primers along 
the second joined probe strand (or its complement) to produce one or a plurality of double 
stranded molecules having, in order, a universal primer sequence, a capture tag sequence, 
20 second target sequence, a capture tag sequence, and a universal primer sequence; 

cleaving at the cleavage site of one or both ends of a double strand joined probe to 
provide a derivative nucleic acid which is a double stranded molecule having overhangs 
which include the capture tag sequence at one or both of the 3' and 5' termini, preferably at 
the 3' terminus, 

25 The method can further include analyzing the derivative nucleic acids of the method, 

e.g., by evaluating the capture tag sequence of an overhang (The derivative nucleic acids can 
also be analyzed by analysis of the target-specific regions of the amplicons). The sequence of 
the overhang(s) can provide information as to the sequence of the target nucleic acid(s). The 
analysis can be performed on a duplex molecule which has an overhang at one or both ends 

30 or on single strands. One or both single strands can be analyzed. In the case where both 
strands are analyzed it is preferable that the sequence tags on each strand be different from 
one another, allowing, e.g., independent analysis, e.g., on an array of capture probes. 
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In preferred embodiments hybridization to the array is detected by any of: 
fluorescence, a proximity based signal generating system, or by mass spectrophotometry, 
e.g., by MALDI-TOF mass spectrophotometry. 

In preferred embodiments the method includes one or more enzyme mediated 
reactions in which a nucleic acid used in the method, e.g„ a capture probe or a derivative 
nucleic acid, is the substrate or template for the enzyme mediated reaction. This can increase 
the specificity of the evaluation. The enzyme mediated reaction can be: an extension 
reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a ligation, e.g., a 
reaction catalyzed by a ligase; or a nucleic acid cleavage reaction, e.g., a cleavage catalyzed 
by a restriction enzyme, e.g., a Type lis enzyme. The derivative nucleic acid sequence which 
hybridizes with the capture probe can be the substrate in an enzyme mediated reaction, e.g., it 
can be ligated to a strand of the capture probe or it can be extended along a strand of the 
capture probe. Alternatively, the capture probe can be extended along the derivative nucleic 
acid sequence. Alternatively, a separate primer can be contacted with the derivative nucleic 
acid for the analysis. (Any of the extension reactors discussed herein can be performed with 
labeled, or chain terminating, subunits.) The capture probe duplex can be the substrate for a 
cleavage reaction. These reactions can be used to increase specificity of the method or to 
otherwise aid in detection, e.g., by providing a signal. A wash following a reaction, e.g., a 
ligase reaction, can improve detection and/or specificity. 

In a preferred embodiment the method includes detecting a genetic event, e.g., a 
single nucleotide polymorphism, in a target or sample. 

Methods of the invention can be used with a wide variety of samples and targets. In 
preferred embodiments the target is: a DNA molecule: all or part of a known gene; wild type 
DNA; mutant DNA; a genomic fragment, particularly a human genomic fragment; a cDNA, 
particularly a human cDNA. In preferred embodiments the target a cDNA the synthesis of 
which was directed by: an RNA molecule: an RNA transcript; wild type RNA; mutant RNA, 
or a human RNA. In preferred embodiments the target sequence is: a human sequence; a 
non-human sequence, e.g., a mouse, rat, pig, primate. In preferred embodiments the method 
is performed: on a sample from a human subject; and a sample from a prenatal subject; as 
part of genetic counseling; to determine if the individual from which the target nucleic acid is 
taken should receive a drug or other treatment; to diagnose an individual for a disorder or for 
predisposition to a disorder; to stage a disease or disorder, to determine if a party should pay 
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from the capture tag on the first single stranded probe, and wherein the cleavage site and 
capture tag sequence are disposed such that cleavage results in a single-stranded overhang, at 
one or both of its 3' and 5' termini, preferably a 3' single stranded overhang, which includes 
at its terminus, the capture sequence tag, and 
5 a universal primer sequence, which is preferably, of the same sequence as the 

universal primer sequence on the first single stranded probe; 

contacting the first single stranded probe with the first target and the second single 
stranded probe with the second target under conditions which allow the circularization of the 
a single stranded circular probe if to be circularized if a target is present which is homologous 
10 to its terminal target binding regions; 

contacting the first and second probes with a universal primer under conditions 
which allow rolling circle amplification and produce double stranded amplification product; 

cleaving the double stranded amplification product at cleavage sites, e.g., with a 
restriction enzyme, e.g., a type II restriction enzyme, to provide cleaved product having a 
1 5 capture tag sequence at the terminus of a single stranded overhang. 

The method can further include analyzing the derivative nucleic acids of the method, 
e.g., by evaluating the capture tag sequence of an overhang (The derivative nucleic acids can 
also be analyzed by analysis of the target-specific regions of the amplicons). The sequence of 
the overhang(s) can provide information as to the sequence of the target nucleic acid(s). The 
20 analysis can be performed on a duplex molecule which has an overhang at one or both ends 
or on single strands. One or both single strands can be analyzed. In the case where both 
strands are analyzed it is preferable that the sequence tags on each strand be different from 
one another, allowing, e.g., independent analysis, e.g., on an array of capture probes. 

A preferred method of evaluating the derivative nucleic acid molecules having an 
25 overhang includes contacting the molecules with a capture probe, e.g., a capture probe bound 
to a insoluble substrate, e.g., a bead, or an ordered array of probes. E.g., the molecules can be 
hybridized to an array having a plurality of capture probes, wherein each of the capture 
probes is positionally distinguishable from the other capture probes of the plurality and has a 
unique variable region not repeated in another capture probe of the plurality. In a particularly 
30 preferred embodiment the derivative nucleic acid is ligated to a capture probe, e.g., to a 
partially duplex probe and then washed, preferably stringently washed, to achieve highly 
specific capture. The unique regions hybridize with the capture tag sequence of a single 
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strand molecule or the single strand overhang(s) of a double stranded molecule, thereby 
allowing evaluation, e.g., identification or quantitation, or further analysis, e.g., sequence 
analysis, by a variety of analytical methods, of a target molecule. A capture probe can be 
single or double stranded but preferably has a double stranded region and a single stranded 
> region. Preferably it has a 3' end capable of serving as a priming site for extension. 

Preferably the unique region is in a 3' or 5' overhang, preferably the 3' single strand overhang 
of the capture probe. The template dependent extension of the 3 ' end of the capture probe is 
diagnostic for capture of the derivative nucleic acid. However, there are cases (e.g., when the 
derivative nucleic acids is to be analyzed by primer extension using a special primer) that the 
• 3 ' end should be blocked so that no extension can occur. 

In a preferred embodiment the partially duplex probes include a single molecule 
which is partially self-complementary and forms a hairpin structure with either a 3' or 5' 
overhang and optionally which contains a chemical moiety that allows for the immobilization 
and spatial separation of the probe molecules. 

In a preferred embodiment the partially duplex probes include a pair of molecules 
which are bound by non-covalent means to form a structure with either a 3 ' or 5' overhang 
and optionally which contains a chemical moiety on one of the molecules that allows for the 
immobilization and spatial separation of the probe molecules. 

In a preferred embodiment the partially duplex probes consists of a pair of molecules 
which are bound by covalentmeans to form a structure with either a 3' or 5' overhang and 
which contains a chemical moiety on one of the molecules that allows for the immobilization 
and spatial separation of the probe molecules. 

In a preferred embodiment the array is a three-dimensional array, e.g., a gel array. 
In preferred embodiments hybridization to the array is detected by any of: 
fluorescence, a proximity based signal generating system, or by mass spectrophotometry, 
e.g., by MALDI-TOF mass spectrophotometry. 

In preferred embodiments the method includes one or more enzyme mediated 
reactions in which a nucleic acid used in the method, e.g., a capture probe or a derivative 
nucleic acid, is the substrate or template for the enzyme mediated reaction. This can increase 
the specificity of the evaluation. The enzyme mediated reaction can be: an extension 
reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a ligation, e.g., a 
reaction catalyzed by a ligase; or a nucleic acid cleavage reaction, e.g., a cleavage catalyzed 
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by a restriction enzyme, e.g., a Type lis enzyme. The derivative nucleic acid sequence which 
hybridizes with the capture probe can be the substrate in an enzyme mediated reaction, e.g., it 
can be ligated to a strand of the capture probe or it can be extended along a strand of the 
capture probe. Alternatively, the capture probe can be extended along the derivative nucleic 
5 acid sequence. Alternatively, a separate primer can be contacted with the derivative nucleic 
acid for the analysis. (Any of the extension reactors discussed herein can be performed with 
labeled, or chain terminating, subunits.) The capture probe duplex can be the substrate for a 
cleavage reaction. These reactions can be used to increase specificity of the method or to 
otherwise aid in detection, e.g., by providing a signal. A wash following a reaction, e.g., a 

1 0 ligase reaction, can improve detection and/or specificity. 

In a preferred embodiment the derivative nucleic acid is allowed to hybridize to the 
array and the 3* end of the capture probe is extend across the region of the target sequence, 
e.g., a genomic nucleic acid having a genetic event, with one or more terminating base 
species, where if more than one is used each species has a unique distinguishable label e.g. 

1 5 label 1 for base A, label 2 for base T, label 3 for base G, and label 4 for base C; thereby 
analyzing the amplified sample sequence. 

In a preferred embodiment the method includes detecting a genetic event, e.g., a 
single nucleotide polymorphism, in a target or sample. 

In preferred embodiments, a genetic event is within a region which hybridizes to a 

20 probe, e.g., is 1, 2, 3, 4 or 5 base pairs from the end of a region which hybridizes to a probe, 
or is at or sufficiently close to the end of the a region which hybridizes to a probe that a 
mismatch would inhibit ligation or DNA polymerase-based extension. 

Methods of the invention can be used with a wide variety of samples and targets. In 
preferred embodiments the target is: a DNA molecule: all or part of a known gene; wild type 

25 DNA; mutant DNA; a genomic fragment, particularly a human genomic fragment; a cDNA, 
particularly a human cDNA. In preferred embodiments the target a cDNA the synthesis of 
which was directed by: an RNA molecule: an RNA transcript; wild type RNA; mutant RNA, 
or a human RNA. In preferred embodiments the target sequence is: a human sequence; a 
non-human sequence, e.g., a mouse, rat, pig, primate. In preferred embodiments the method 

30 is performed: on a sample from a human subject; and a sample from a prenatal subject; as 

part of genetic counseling; to determine if the individual from which the target nucleic acid is 
taken should receive a drug or other treatment; to diagnose an individual for a disorder or for 
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predisposition to a disorder; to stage a disease or disorder; to determine if a party should pay 
for a treatment, e.g., to determine if one party, e.g., an insurance company or government, 
should pay or reimburse a party which has paid for a treatment. 

In a preferred embodiment the single stranded linearized circular probe will have its 
termini joined only when there is perfect complementarity between the regions of the probe 
which hybridize to the target and the target. 

In a preferred embodiment more than two, e.g., and as many as 10, 50, 100, 200, 500, 
1000, or 5000 target sequences are present and a single strand linearized circular probe 
specific for each is contacted with the sample. 

In some embodiments a single nucleic acid molecule will include more than one 
target sequence, in others, target sequences are on separate molecules, e.g., separate 
restriction or shear fragments. 

In a preferred embodiment the method is used to detect a genetic event, e.g., a 
mutation or an SNP, in a target sequence. The nucleotide at complementary to the genetic 
event or to a nucleotide of a genetic event, can be degenerate, e.g., the method can include the 
use of a plurality of primers which differ from each other at the interrogation site, e.g., four 
primers, one each with a, g, c, and t, at the interrogation site. Each of the plurality of primers 
will havea different capture tag, thus the method allows identification or sequencing of the 
nucleotide of interest. 

(b) combining the circular template with an effective amount of a RCA primer, at 
least two types of nucleotide triphosphates and an effective amount of a polymerase enzyme 
to yield a product, e.g., a single-stranded oligonucleotide rnultimer complementary to the 
circular oligonucleotide template; and 

In preferred embodiments amplification reactions are performed isothermally. 

In preferred embodiments, analyzing a sample polynucleotide sequence includes, e.g., 
sequencing the polynucleotide sequence, e.g., by sequencing by hybridization or positional 
sequencing by hybridization, detecting the presence of, or identifying, a genetic event, e.g., a 
SNP, in a target nucleic acid, e.g., a DNA. 

In preferred embodiments, the genetic event is within 1, 2, 3, 4 or 5 base pairs from 
the end of the target region which hybridizes to the probe, or is sufficiently close to the end of 
the target region which hybridizes to the probe that a mismatch would inhibit DNA 
polymerase-based extension from a target/ primed circle. 
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In preferred embodiments, the target nucleic acid is amplified, e.g., by PCR, prior to 
contact with a circular template. 

Preferably, a circular template has about 15-1500 nucleotides, and more preferably 
about 24-500 nucleotides and most preferably about 30-150 nucleotides. 
5 The polymerase enzyme can be any that effects the synthesis of the multimer, e.g., 

any polymerase described in 5, 714, 320. Generally, the definitions provided for circular 
vectors and their amplification in 5, 714,320, apply to terms used herein, unless there is a 
conflict between the terms in which case the meaning provided herein controls. 5, 714, 320, 
and all other U.S. patents mentioned herein are incorporated by reference. 
10 In another aspect, the invention features a method of providing a nucleic acid with 

single strand overhang, preferably a double strand nucleic acid with single strand overhangs 
at one or both ends, suitable for multiplex analysis e.g., suitable for detecting one or more 
genetic events, e.g., one or more single nucleotide polymorphisms, in a sample. 

The method includes: 

1 5 (a) providing a target nucleic acid having a first and second region, wherein the 

two regions preferably overlap; 

providing an invader probe which is complementary to the first region of the 

target, 

providing a signal probe having, in the 5' to 3' direction, (optionally) a signal 
20 sequence, a capture tag sequence, and a region complementary to the second region of the 
target nucleic acid; 

(b) contacting the. target sequence with the invader probe and the signal probe, 
under conditions wherein the invader probe and an end, e.g., the 3' end of the signal probe are 
annealed to the target nucleic acid sequence so as to create a cleavage structure having a 

25 single-stranded arm which includes, optionally, the signal sequence, and the capture tag; 

(c) cleaving the first cleavage structure under conditions such that cleavage of the 
cleavage structure occurs at a site located within the signal probe in a manner dependent upon 
the annealing of the invader and signal probes on the target nucleic acid such that cleavage 
liberates the single-stranded arm of the cleavage structure to generate a derivative nucleic 

30 acid which has the capture tag sequence at a terminus, preferably its 3* terminus; 

(d) optionally allowing a subsequent, or a plurality of subsequent copies of the 
signal probe to anneal and be cleaved to produce an additional or a plurality of additional 
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derivative nucleic acids which have the capture tag sequence at a terminus, preferably the 3' 



thereby providing a derivative nucleic acid suitable for analysis. 

In a preferred embodiment, the signal probe can include a sequence 5' to the capture 
! tag which self hybridizes to form a hairpin structure, such that cleavage results in a structure 
having a hair pin with a single strand overhang. The capture tag is on the single strand 
overhangs which is preferably a 3' overhang. 

Methods of the invention allow for multiplexed reactions, e.g., reactions in which two 
or more targets, and preferably as many as 10, 50, 100, 200, 500, 1000, or 5000, are analyzed. 
Targets can be on the same molecule or can be on different molecules. In the examples 
below an embodiment wherein the targets are on different molecules is described, but the 
method can be used to analyze different regions of a single molecule. Thus, in a preferred 
embodiment the reaction mix includes a second target having a first and second region, and a 
second invader and signal probe, wherein 

the second invader probe is complementary to a first region of a second target, 
the second signal probe includes, in the 5* to 3* direction,(optionally) a signal 
sequence, a capture tag sequence (which is preferably different in sequence from the capture 
tag sequence on the first signal probe, and a region complementary to a second region of the 
second target nucleic acid; 

(b) contacting the second target sequence with the second invader probe and the 
second signal probe, under conditions wherein the second invader probe and an end, e.g., the 
3' end of the second signal probe are annealed to the second target nucleic acid sequence so 
as to create a second cleavage structure having a single-stranded arm which includes, 
optionally, the signal sequences, and the capture tag; 

(c) cleaving the second cleavage structure under conditions such that cleavage of 
the second cleavage structure occurs at a site located within the second signal probe in a 
manner dependent upon the annealing of the second invader and signal probes on the target 
nucleic acid such that cleavage liberates the single-stranded arm of the cleavage structure to 
generate a second derivative nucleic acid which has the capture tag sequence at a terminus, 
preferably its 3* terminus; 

(d) optionally allowing a subsequent, or a plurality of subsequent copies of the 
second signal probe to anneal and be cleaved to produce an additional or a plurality of 
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additional derivative nucleic acids which have the capture tag sequence at a terminus, 
preferably the 3* terminus. 

In a preferred embodiment, the signal probe can include a sequence 5' to the capture 
tag which self hybridizes to form a hairpin structure, such that cleavage results in a structure 
5 having a hair pin with a single strand overhang. The capture tag is on the single strand 
overhangs which is preferably a 3' overhang. 

The method can further include analyzing the derivative nucleic acids of the method, 
e.g., by evaluating the capture tag sequence of an overhang (The derivative nucleic acids can 
also be analyzed by analysis of the target-specific regions of the amplicons). The sequence of 

10 the overhang(s) can provide information as to the sequence of the target nucleic acid(s). The 
analysis can be performed on a duplex molecule which has an overhang at one or both ends 
or on single strands. One or both single strands can be analyzed. In the case where both 
strands are analyzed it is preferable that the sequence tags on each strand be different from 
one another, allowing, e.g., independent analysis, e.g., on an array of capture probes. 

15 A preferred method of evaluating the derivative nucleic acid molecules having an 

overhang includes contacting the molecules with a capture probe, e.g., a capture probe bound 
to a insoluble substrate, e.g., a bead, or an ordered array of probes. E.g., the molecules can be 
hybridized to an array having a plurality of capture probes, wherein each of the capture 
probes is positionally distinguishable from the other capture probes of the plurality and has a 

20 unique variable region not repeated in another capture probe of the plurality. In a particularly 
preferred embodiment the derivative nucleic acid is ligated to a capture probe, e.g., to a 
partially duplex probe and then washed, preferably stringently washed, to achieve highly 
specific capture. The unique regions hybridize with the capture tag sequence of a single 
strand molecule or the single strand overhang(s) of a double stranded molecule, thereby 

25 allowing evaluation, e.g., identification or quantitation, or further analysis, e.g., sequence 
analysis, by a variety of analytical methods, of a target molecule. A capture probe can be 
single or double stranded but preferably has a double stranded region and a single stranded 
region. Preferably it has a 3' end capable of serving as a priming site for extension. 
Preferably the unique region is in a 3' or 5' overhang, preferably the 3' single strand overhang 

30 of the capture probe. The template dependent extension of the 3' end of the capture probe 
can be diagnostic for capture of the derivative nucleic acid. However, there are cases (e.g., 
when the derivative nucleic acids is to be analyzed by primer extension using a special 
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primer) that the 3* end should be blocked so that no extension can occur. 

In a preferred embodiment a partially duplex probes include a single molecule which 
is partially self-complementary and forms a hairpin structure with either a 3' or 5' overhang 
and optionally which contains a chemical moiety that allows for the immobilization and 
5 spatial separation of the probe molecules. 

In a preferred embodiment a partially duplex probes include a pair of molecules 
which are bound by non-covalent means to form a structure with either a 3' or 5' overhang 
and optionally which contains a chemical moiety on one of the molecules that allows for the 
immobilization and spatial separation of the probe molecules. 
> In a preferred embodiment a partially duplex probes consists of a pair of molecules 

which are bound by covalent means to form a structure with either a 3' or 5' overhang and 
which contains a chemical moiety on one of the molecules that allows for the immobilization 
and spatial separation of the probe molecules. 

In a preferred embodiment the array is a three-dimensional array, e.g., a gel array. 
In preferred embodiments hybridization to the array is detected by any of: 
fluorescence, a proximity based signal generating system, or by mass spectrophotometry, 
e.g., by MALDI-TOF mass spectrophotometry. 

In preferred embodiments the method includes one or more enzyme mediated 
reactions in which a nucleic acid used in the method, e.g., a capture probe or a derivative 
nucleic acid, is the substrate or template for the enzyme mediated reaction. This can increase 
the specificity of the evaluation. The enzyme mediated reaction can be: an extension 
reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a ligation, e.g., a 
reaction catalyzed by a Jigase; or a nucleic acid cleavage reaction, e.g., a cleavage catalyzed 
by a restriction enzyme, e.g., a Type Us enzyme. The derivative nucleic acid sequence which 
hybridizes with the capture probe can be the substrate in an enzyme mediated reaction, e.g., it 
can be ligated to a strand of the capture probe or it can be extended along a strand of the 
capture probe. Alternatively, the capture probe can be extended along the derivative nucleic 
acid sequence. Alternatively, a separate primer can be contacted with me derivative nucleic 
acid for the analysis. (Any of the extension reactors discussed herein can be performed with 
labeled, or chain terminating, suounirs.) The capture probe duplex can be the substrate for a 
cleavage reaction. These reactions can be used to increase specificity of the method or to 
otherwise aid in detection, e.g., by providing a signal. A wash following a reaction, e.g., a 
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ligase reaction, can improve detection and/or specificity. 

In a preferred embodiment the derivative nucleic acid is allowed to hybridize to the 
array and the 3' end of the capture probe is extend across the region of the target sequence, 
e.g., a genomic nucleic acid having a genetic event, with one or more terminating base 
5 species, where if more than one is used each species has a unique distinguishable label e.g. 
label 1 for base A, label 2 for base T, label 3 for base G, and label 4 for base C; thereby 
analyzing the amplified sample sequence. 

In a preferred embodiment the method includes detecting a genetic event, e.g., a 
single nucleotide polymorphism, in a target or sample. 

10 In preferred embodiments, a genetic event is within a region which hybridizes to a 

probe, e.g., is 1, 2, 3, 4 or 5 base pairs from the end of a region which hybridizes to a probe, 
or is at or sufficiendy close to the end of the a region which hybridizes to a probe that a 
mismatch would inhibit ligation or DNA polymerase-based extension. 

Methods of the invention can be used with a wide variety of samples and targets. In 

1 5 preferred embodiments the target is: a DNA molecule: all or part of a known gene; wild type 
DNA; mutant DNA; a genomic fragment, particularly a human genomic fragment; a cDNA, 
particularly a human cDNA. In preferred embodiments the target a cDNA the synthesis of 
which was directed by: an RNA molecule: an RNA transcript; wild type RNA; mutant RNA, 
or a human RNA. In preferred embodiments the target sequence is: a human sequence; a 

20 non-human sequence, e.g., a mouse, rat, pig, primate. In preferred embodiments the method 
is performed: on a sample from a human subject; and a sample from a prenatal subject; as 
part of genetic counseling; to determine if the individual from which the target nucleic acid is 
taken should receive a drug or other treatment; to diagnose an individual for a disorder or for 
predisposition to a disorder; to stage a disease or disorder, to determine if a party should pay 

25 for a treatment, e.g., to determine if one party, e.g., an insurance company or government, 
should pay or reimburse a party which has paid for a treatment 

The cleavage structure can be cleaved with, e.g., a cleavage agent, such as those 
described in U.S. Patent No. 5,871,91 1. By "cleavage agent" is meant a molecule such as a 
DNA polymerase (DNAP), a domain of a DNAP, or a synthetically created protein or 

30 peptide, capable of cleaving a cleavage structure at a specific site. These enzymes are 

sometimes referred to as "cleavases" or as "flap endonucleases". This activity is sometimes 
associated with polymerases, but the activity of course is always an endonuclease. A 
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preferable example of a cleavage agent is a 5' nuclease activity of DNAP, such as DNAPTaq 
DNAPTfl, DNAPTth and DNAPEcl. After exposing the cleavage structure to a cleavage 
agent, the structure and agent are incubated under conditions wherein cleavage can occur 
The derivative nucleic acid has a capture tag sequence, e.g., a sequence which is 
5 complementary to capture array probes. The sequences complementary to capture probes can 
be portioned within the second oligonucleotide such that they are at the 3' end of the third 
oligonucleotide upon deavage of the second oligonucleotide. Thus, products from a 
multiplexed assay can be captured at different positions on an array of capture probes 
In some embodiments, the sequence at the 5' end of the derivative nucleic acid 

0 mcludes an arbitrary sequence and can includes a label, e.g., a primary label, such as biotin 
When b.otin us used, a streptavidin labeled enzyme can be attached to detect the captured 
third oligonucleotide. 

In some embodiments, the 5' end of the derivative nucleic acid includes a sequence 
tag, which can encode information, e.g., a specific allele for a polymorphic DNA sequence 
5 The information can be detected using different reporter moieties, e.g., different colors of 
fluorophors. In some embodiments, the third oligonucleotide is detected using enzyme-based 
chemiluminescence or fluorogenic amplification. 

In some embodiments, the method is used to analyze a genetic even, e.g a SNP or a 
mutation. Tne genetic event can occur in the first region, the second region, or at the junction 

1 of the two regions. 

In some embodiments, a biallelic or multi-allelic SNP is analyzed. 
In some embodiments, the third oligonucleotides from a plurality of sites are 
analyzed. 

In some embodiments, the derivative nucleic acid is further amplified eg by an 
thermal method, prior to further analysis, e.g., prior to hybridization to the capture probe 
array. The amplification can include, e.g., rolling circle amplification (RCA). When RCA is 
used, the method can include: 

(a) annealing an effective amount of the derivative nucleic acid to a single-stranded 
circular template to yield an annealed circular template, wherein the single-stranded circular 
template comprises (i) at least one copy of a nucleotide sequence complementary to the 
sequence of the derivative nucleic acid and optionally, (ii) at least one nucleotide effective to 
produce a cleavage site in an oligonucleotide multimer; 



WO 01/06012 



27 



PCT/US00/19176 



(b) providing the primed circular template with effective amounts of a primer, at least 
two types of nucleotide triphosphates and a polymerase enzyme, to yield a single-stranded 
oligonucleotide multimer complementary to the circular oligonucleotide template, wherein 
the oligonucleotide multimer comprises multiple copies (amplified) of the sample sequence; 

.5 optionally, 

(c) cleaving the oligonucleotide multimer at the cleavage site to produce the cleaved 
amplified sample nucleic acid; and 

(d) hybridizing the cleaved sample nucleic acid to the array of capture probes. 

In some embodiments, the method is used to detect single nucleotide polymorphisms 
1 0 (SNPs). Two different probes can be encoded with different tag sequences. Multiple pairs of 
signal probes can be used simultaneously with the same sample to multiplex the analysis of 
many target sequences. The products of the reaction can then be analyzed on a capture probe 
array, e.g., a PSBH array. 

Methods of the invention allow for the capture of cleavage products from cleavase 
1 5 invader assays. Accordingly, in one aspect, the invention includes a method of analyzing a 
polynucleotide sequence, e.g., a specific target nucleic acid molecule. The method includes: 

(a) providing (i) a target nucleic acid having a first and second portion; (ii) a first 
oligonucleotide complementary to the first portion of the target nucleic acid; and (ii) a second 
oligonucleotide having a 5' and 3' end and a region complementary to the second portion of 

20 the target nucleic acid, as well as a region which is not complementary to the target nucleic 
acid; 

(b) mixing the target nucleic acid, first oligonucleotide, and second nucleotide under 
conditions in which the first oligonucleotide and an end, e.g., the 3' end of the second 
oligonucleotide are annealed to the target nucleic acid sequence so as to create a cleavage 

25 structure having a single-stranded arm; 

(c) cleaving the first cleavage structure under conditions such that cleavage of the 
cleavage structure occurs at a site located within the second oligonucleotide in a manner 
dependent upon the annealing of the first and second oligonucleotides on the target nucleic 
acid such that cleavage liberates the single-stranded arm of the cleavage structure to generate 

30 a third oligonucleotide; 

(d) providing an array of a plurality of capture probes, wherein each of the capture 
probes is positionally distinguishable from other capture probes of the plurality on the array 
and wherein each of the capture probes contains a region of unique sequence; and 
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(e) hybridizing the third oligonucleotide with the array of capture probes thereby 
analyzing the sample sequence. ' y 

The* cleavage structure can be cleaved with, e.g., a cleavage agent, such as those 

5 ™T b ? nU - s - Pat ^^ 
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and DNAPEcI. After exposmg the cleavage structure to a cleavage agent, the structure 2 
agent are incubated under conditions wherein cleavage can occur 

0 In some embodiments, the third oligonucleotide has an arbitrary sequence e ■ a 
sequence wh.chmcludes an arbitrary tag sequence complen.ntary tolpl a« s 
fences complement 

ft econdohgonucleo.de. Thus, products from a multiplexed assay can be captured at 
5 different positions on an array of capture probes. 

an a hi" ^ emb ° dimentS ' at * «* of the third oligonucleotide includes 

an arbUxary sequence and includes a label, e.g., a primary label, such as biotin When bH 

1 which J" SOm ^ mb ! dimentS ' Ae 5 ' «* of the third oligonucleotide includes a sequence tag 

fluorophors m some embodiments, the third oligonucleotide is detected using enzyme- 
based chemilurmnescence or fluorogenic amplification. 

In some embodiments, the method is used to analyze a genetic even e a a SNP nr 

In some embodiments, a biajjelic nr mum-allelic SNP is analyzed 
analyj," """ " Mam ^ *» * W ^.eodte fern . plonllity of sites , re 

m:rr™^r^^ 

W^^S^^cnvea^onntofaeairdo,^^,,,^ 
circular template to yield an annealed circular template, wherein the smgle-stranded^hcular 
^-^(Oatleastoneeopyofanoc^de^ueneecomplm .maryl^ 
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sequence of the third oligonucleotide and optionally, (ii) at least one nucleotide effective to 
produce a cleavage site in an oligonucleotide multimer; 

(b) providing the primed circular template with effective amounts of a primer, at least 
two types of nucleotide triphosphates and a polymerase enzyme, to yield a single-stranded 

5 oligonucleotide multimer complementary to the circular oligonucleotide template, wherein 
the oligonucleotide multimer comprises multiple copies (amplified) of the sample sequence; 
optionally, 

(c) cleaving the oligonucleotide multimer at the cleavage site to produce the cleaved 
amplified sample nucleic acid; and 

IQ hybridizing the cleaved sample nucleic acid to the array of capture probes. 

In some embodiments, the method is used to detect single nucleotide polymorphisms 
(SNPs). Two different probes can be encoded with different tag sequences. Multiple pairs of 
signal probes can be used simultaneously with the same sample to multiplex the analysis of 
many target sequences. The products of the reaction can then be analyzed on a capture probe 
1 5 array, e.g., a PSBH array. 

Methods of the invention also allow for capture of single stranded products form 
multiplexed PCR. Accordingly, in another aspect, the invention includes a method of 
analyzing a polynucleotide sequence. The method includes: 

providing a double-stranded nucleic acid sequence, wherein the 5' ends of one or both 
20 sequences have a region of unique sequence; 

(optionally) contacting the double-stranded nucleic acid sequence with an agent which 
promotes strand separating, e.g., a peptide-nucleic acid (PNA) having a sequence, e.g.,. a 
genetic sequence, complementary to a region near the end or ends of the double-stranded 
nucleic acid molecule, thereby exposing a region of single-stranded nucleic acid sequence 
25 containing the unique sequence; and 

hybridizing the nucleic acid sequence having an exposed single-stranded region to a 
capture array, thereby analyzing the nucleic acid sequence. 

In some embodiments, the hybridized nucleic acid sequence is ligated to the capture 
array. Preferably, ligation is followed by a washing step. 
30 In some embodiments, the method includes providing a pair of primers, e.g., PCR 

primers, having generic tags and peptide nucleic acid (PNA>binding sites and using the 
primers to amplify the nucleic acid sequence. PNA are discussed in detail in Kuhn et al. J . 
Mol. Evol. 286:1337-1345 (1999) and Bukanov et al., Proc. Nat Acad. Sci. (USA) 95:5516- 
20 (1998). 
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Methods of the invention can be used to capture unamplified genomic targets and 
analyze them in an array. Accordingly, in another aspect, the invention features a method of 
detecting a nucleic acid sequence. The method includes: 

providing a population of double-stranded molecules having a defined sequence 
5 adjacent to at least one of their ends, wherein the defined sequences have a first and second 
defined regions, separated by a unique sequence region; 

(optionally) opening the duplex at one end, e.g.,'by contacting the double-stranded 
molecules with a first and second peptide nucleic acid complementary to the first and second 
defined regions under conditions sufficient to cause the unique sequence region to become 
) single-stranded; and 

hybridizing the double-stranded molecules having the single-stranded unique 
sequence region to a capture probe array, thereby analyzing the polynucleotide sequence 

In some embodiments, the double-stranded nucleic acid sequences are generated by 
digestion with a restriction enzyme. 

In some embodiments, the single-stranded unique region is annealed with a primer 
havmg an arbitrary sequence tags at its 5' end. The annealed primer is then extended with a 

Polymerase and other reagents known in the art. If desired, the extended strand is 
further amplified using PCR or the RCA amplification methods described herein. 

In some embodiments, the products of the primer extension or amplification are 
hgated to the capture probe arrays after hybridization. Preferably, the fragments are washed 
after ligation. The ligated capture probes can be further analyzed, e.g., by in situ RCA 
In some embodiments, the capture probes have 5' overhangs. 

In some embodiments, the single-strand unique sequence region is hybridized with an 
oligonucleotide. 

In another aspect, the invention includes a method of thereby analyzing a nucleic acid 
sequence. The method includes: 

providing double-stranded fragments, e.g., genomic fragments, having at least one 
defined end, produced, e.g., by restriction digestion; 

denaturing an end of a fragment, e.g., by contact with a peptide nucleic acid to 
produce an open end; 

hybridizing a primer to the open end and extending it; 

hybridizing the extension produces to a capture probe array, thereby analyzing a 
nucleic acid sequence. 

In some embodiments, the primer has a generic sequence tag, e.g., at its 5' end, which 
can be hybridized to a capture probe, e.g., a PSBH array. 

In some embodiments, the nucleic acid sample is unamplified genomic DNA. 
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In some embodiments, the products of the primer extension or amplification are 
ligated to the capture probe arrays after hybridization. Preferably, the fragments are washed 
after ligation. The ligated capture probes can be further analyzed, e.g., by in situ RCA. 

In some embodiments, the capture probes have 5' overhangs. 
5 In some embodiments, the single-strand unique sequence region is hybridized with an 

oligonucleotide. 

The invention also features bifunctional oligonucleotide probes and methods of their 
use. Accordingly, in another aspect, the invention includes a method for analyzing a nucleic 
acid sequence. The method includes: 
I o providing a population of single-stranded nucleic acid molecules, e.g. , denatured 

genomic DNA; 

contacting the population of nucleic acid molecules with an oligonucleotide to form a 
partially duplex region on the single-stranded nucleic acid molecules, wherein the 
oligonucleotide has a first region complementary to a defined nucleic acid sequence and a 
1 5 second region containing a recognition sequence for a type IIS restriction enzyme, e.g., Fok I; 

cleaving the partially duplex region with the type IIS restriction enzyme, wherein the 
restriction enzyme cleaves in the first region of the oligonucleotide to form a oligonucleotide 
digestion product; and 

hybridizing the oligonucleotide digestion product to a capture probe array, thereby 
20 analyzing the nucleic acid sequence. 

In some embodiments, the hybridized product is ligated to a capture probe array. 

The invention also features hybrid generic/custom dual probe arrays and methods of 
their use. Accordingly, in another aspect, the invention features a method for analyzing a 
nucleic acid sequence. The method includes: 
25 providing an array comprising a plurality of capture probes having a single-stranded 

region and a double-stranded region, wherein preferably the plurality of capture probes 
includes a generic capture probe and a custom capture probe. 

hybridizing a sample nucleic acid to the array, wherein the hybridizing sample nucleic 
acid binds to a single-stranded region in a capture probe in the array so that the 5' terminus of 
30 the sample nucleic acid abuts the 3' terminus of the double-stranded region of the capture 
probe; and 

ligating the sample nucleic acid to the 3* terminus of the double-stranded region of the 
capture probe. 

In some embodiments, the sample nucleic acid is labeled, e.g., with biotin, a 
3 5 fiuorophore, or one or more radiolabeled nucleotides. 



BHS oans 32 



WO 01/06012 



32 



PCT/US00/19176 



In some embodiments, the 3* terminus of the sample nucleic acid is extended using a 
DNA polymerase and sequences in the single stranded region of the capture probe as the 



In some embodiments, the extended product is labeled, e.g., with biotin, a 
5 fluorophore, or one or more radiolabeled nucleotides. 

The invention also features creation of custom arrays by hybridization of probe pools 
Accordingly, in another aspect, the invention includes a method for forming a custom array. 
The method includes: 

providing an array of a plurality of foundation probes, wherein each of the foundation 
) probes is positional^ distinguishable from other foundation probes of the plurality on the 
array and wherein each of the foundation probes contains a region of unique sequence; 

contacting the foundation probe array with a plurality of custom oligonucleotides 
comprising a first and second sequence, wherein the first sequence is complementary to a 
defined sequence in a foundation probe in the foundation probe array and the second 
sequence is complementary to a target sequence, thereby forming a custom probe array. 

In some embodiments, the 5' end of the oligonucleotide abuts the 3' end of a 
nucleotide in the foundation probe array. 

In some embodiments, the oligonucleotide can be ligated to the foundation probe 

array. 

In some embodiments, a sample of nucleic acids is hybridized to the capture probe 
array and nucleic acids in the sample which hybridize to the second sequence in the 
oligonucleotide are identified. In some embodiments, the 3" terminus of the hybridizing 
sample nucleic acid is extended using a DNA polymerase and sequences in the capture probe 
array as a template. 

In some embodiments, the sample nucleic acids are labeled. 

The invention also features a kit for analyzing a sample of nucleic acid molecules. 
The array includes: 

a foundation probe array comprising foundation probes having a defined sequence; 

an oligonucleotide comprising a first and second sequence, wherein the first sequence 
is complementary to a defined sequence in a foundation probe in the foundation probe array 
and the second sequence is complementary to a target sequence. 

Methods of the invention can also be used to capture RCA products containing 
multiplex tags. Accordingly, in one aspect, the invention includes a method of analyzing a 
polynucleotide, e.g., detecting a genetic event, e.g., a single nucleotide polymorphism, in a 
sample. The method includes: 

providing a sample which includes a sample polynucleotide sequence to be analyzed; 
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(2) (a) annealing an effective amount of sample sequence to a single-stranded 
circular template to yield an annealed circular template, wherein the single-stranded circular 
template comprises (i) at least one copy of a nucleotide sequence complementary to the 
sequence of the sample sequence and optionally, (ii) at least one nucleotide effective to 

5 produce a cleavage site in an oligonucleotide multimer; 

(b) providing the primed circular template with effective amounts of a primer, at 
least two types of nucleotide triphosphates and a polymerase enzyme, to yield a single- 
stranded oligonucleotide multimer complementary to the circular oligonucleotide template, 
wherein the oligonucleotide multimer comprises multiple copies (amplified) of the sample 

10 sequence; optionally, 

(c) cleaving the oligonucleotide multimer at the cleavage site to produce the 
cleaved amplified sample nucleic acid; and 

(3) analyzing the sample sequence from (2) (b) or (c), e.g , by providing an array 
of a plurality of capture probes, wherein each of the capture probes is positionally 

1 5 distinguishable from other capture probes of the plurality on the array, and wherein each 

positionally distinguishable capture probe of the plurality includes a unique (i.e., not repeated 
in another capture probe) region; and hybridizing the amplified sample sequence with the 
array of capture probes, thereby analyzing the sample sequence. 

In preferred embodiments, the amplified sequence from step 2 of the method can be 

20 further amplified, e.g., amplified by rolling circle, e.g., prior to analysis under step 3. In such 
embodiments, the amplified sample nucleic acid from step 2, e.g., a cleaved amplified sample 
nucleic acid, can be amplified further. The second or other subsequent rolling circle 
amplification can use a circular oligonucleotide probe of the same or similar sequence that 
used as Step 2, or one of a different sequence. It is also possible that the circular 

25 oligonucleotide in a second or subsequent rolling circle amplification, can be, for example, 
closed or open circular template. 

In preferred embodiments, the circular oligonucleic template (of any step) is prepared 
by a process comprising the steps of: 

(a) hybridizing each end of a linear precursor oligonucleotide to a single positioning 

30 oligonucleotide, e.g., a sample sequence, having a 5' nucleotide sequence complementary to a 
portion of the sequence comprising the 3' end of the linear precursor oligonucleotide and a 3' 
nucleotide sequence complementary to a portion of the sequence comprising the 5' end of the 
linear precursor oligonucleotide, to yield an open oligonucleotide circle wherein the 5' end 
and the 3' end of the open circle are positioned so as to abut each other; and 

3 5 (b) joining the 5' end and the 3' end of the open oligonucleotide circle to yield a 

circular oligonucleotide template. Rolling circle amplification can be primed by the 
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positioning oligonucleotide, e.g., the target nucleic acid, or by another primer, in this or other 
methods disclosed herein. 

In preferred embodiments, analyzing a nucleic acid includes, e.g., sequencing the 
nucleic acid, e.g., by sequencing by hybridization or positional sequencing by hybridization 

* „ In , T ferred emfa ° dimentS ' S enetic ev «* within 1, 2, 3, 4 or 5 base pairs from 
the end of the target molecule, or is sufficiently close to the end of the target molecule that a 

U preferred embodiments the inhibition is at least 50, 75, 90 or 99%. 

In preferred embodiments, the target is amplified, e.g., by a isothermal or 
nomsothermal method, e.g., by PGR, prior to contact with a circular template 

In preferred embodiments the circular template includes a site for a type IIS 
restriction enzyme and the site is positioned, e.g., such that a type IIS restriction binding at 

I ?„ J3Cent regi ° n WWCh ^ thC SamplC S ^ ence or <*»™ *» the region 

which binds the sample sequence. & 

In a preferred embodiment a region of the circular template is complementary to a 
genetic event, e.g., a mutation or SNP, and hybridizes effectually to sample nucleic acid 
having the event and sample nucleic acid not having the event. 

specific endonuclease binding site, e.g., a type IIS restriction enzyme binding site, and the 
method includes: 

hybridizing the single stranded target nucleic acid with the capture probe array 
(prefenibly the region of an amplification product which corresponds to the genetic event 
hybridizes with the variable region of a capture probe); 

probe- ( ° Pti0nal,y) %3ting ^ Stranded **** nucleic acid to a strand of the capture 

cleaving the single stranded target nucleic acid/capture probe duplex with a non- 
specific endonuclease, to form a cleaved single stranded target nucleic acid/capture probe 
duplex such that a base corresponding to the genetic event is in the single stranded region 
tormed by the cleavage; 

extending along the single strand which contains the genetic event with one and 
preferably vvith 2, 3, or all 4 labeled chain terminating nucleotides, wherein if more than one 
labeled cham terminating nucleotide is used each of the chain terminators eg AorC are 
distinguishable, such that the incorporation of a chain terminator indicates the presence' of a 
genetic event 
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thereby detecting or identifying a genetic event in a target nucleic acid. 

In preferred embodiments the polynucleotide sequence is: a DNA molecule: all or part 
of a known gene; wild type DNA; mutant DNA; a genomic fragment, particularly a human 
genomic fragment; a cDNA, particularly a human cDNA. 
5 In preferred embodiments the polynucleotide sequence is: an RNA molecule: nucleic 

acids derived from RNA transcripts; wild type RNA; mutant RNA, particularly a human 
RNA. 

In preferred embodiments the polynucleotide sequence is: a human sequence; a non- 
human sequence, e.g., a mouse, rat, pig, primate. 
1 0 In preferred embodiments the method is performed: on a sample from a human 

subject; and a sample from a prenatal subject; as part of genetic counseling; to determine if 
the individual from which the target nucleic acid is taken should receive a drug or other 
treatment; to diagnose an individual for a disorder or for predisposition to a disorder, to stage 
a disease or disorder. 

1 5 In preferred embodiments the capture probes are single stranded probes in an array. 

In preferred embodiments the capture probes have a structure comprising a double 
stranded portion and a single stranded portion in an array. 

In preferred embodiments hybridization to the array is detected by mass 
spectrophotometry, e.g., by MALDI-TOF mass spectrophotometry. 
20 In preferred embodiments probes are selected for minimal cross hybridization with 

other probes. 

In preferred embodiments the amplified sample sequence has attached thereto a first 
member of a proximity detector pair and hybridization to the array allows the first member to 
be brought into proximity with a second member to provide a signal. 
25 In a preferred embodiment the amplified sample sequence which hybridizes to a 

capture probe, or the capture probe, is the substrate of or template for an enzyme mediated 
reactions. For example, after hybridization to the capture probe, the amplified sample 
sequence is ligated to the capture probe, or after hybridization it is extended along the capture 
probe. 

30 In preferred embodiments the method includes one or more enzyme mediated 

reactions in which a nucleic acid used in the method, e.g., an amplified sample sequence, a 
capture probe, a sequence to be analyzed, or a molecule which hybridizes thereto, is the 
substrate or template for the enzyme mediated reaction. The enzyme mediated reaction can 
be: an extension reaction, e.g., a reaction catalyzed by a polymerase; a linking reaction, e.g., a 

35 ligation, e.g., a reaction catalyzed by a ligase; or a nucleic acid cleavage reaction, e.g., a 
cleavage catalyzed by a restriction enzyme, e.g., a Type IIS enzyme. The amplified sample 
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sequence which hybridizes with the capture probe can be the subsu*. in an en™, „• * 
r»c*m,e.g,,i,ca„bc iigated toastiandofutecaptum probe or ,Ta 

5 ""7^^^-- CAeycfthec^i™^^,^^ 
5 be performed wrth labeled, or chaiu tenninaung, subuuits ) The caoture „„*„!, , T 

the m eu,odor to „d,e™ S eaidi„de l »cuou,e.g..byprovidi„ga S i g nal * 

Meftodssucha, those described in U.S. Paten, Nos. 5,503,980 or 5 631 ,34 bothof 

~ tomyMd ™^ teted ^^^--^sUugh,r,het 

In prefened embodiments, the method includes: 
providing an array having a plurality of capture probes wherein each n r,h „ 
a) position^ distinguishable ftom the otL capuue p^ Cult T " " 

^oncapaWeofbybrid^ngadjacentbadtegenedceventL Otaal lZZZ« 
genetic event to a capn.ro probe of the army, (preferably fc regio „ ^ ^ J^V 

nuctac actd havmg , genedc even, with „„ e or more terminating base ^ ^Lore 

^"^^^•.Pmbesandprtaers.armys.tmdofterreagenteordevices 
disclosed herein are also within the invention 

• fc '^™»«-t-ni»usoftbeprobe,whichtaWeaaseque„cewhich 
hybndrzes to , first region of a targe, nucleic acid sequence; 

a second region which inciudes a cleavage site, e.g.,'a site for cleavage by a restriction 
enzyme sequence, e.g., a type Ds enzyme and a capture tag sequence. 

a third region which includes a sequence complementary to a „„ ivma , prinier 
sequence; and 



WO 01/06012 



37 



PCT/US00/19176 



a fourth region, at the other terminus of the probe, which includes a sequence which 
hybridizes to a second region of a target sequence. 

The total length of the probe can range, e.g., between 10 to 500 nucleotides. In 
one example, the first region of the probe can range in length from between about 9 to 50, or 
between 10-15 nucleotides in length. The capture tag sequence within the second region of 
the primer can range in length between 2-100, preferably 4-20, more preferably 4-12 and 
most preferably 4-8 and 6-8 nucleotides in length. The cleavage site within the second region 
can range between 5-40 nucleotides in length. The third region of the sequence can range 
between 6-100, preferably 7-50, more preferably 9-30 and most preferably 10-25 and 12-20 
nucleotides in length. The fourth region of the primer can range in length from between 
about 9 to 50, or between 10-15 nucleotides in length. 

Capture tag sequences should be designed to minimize secondary structure and to 
promote efficient hybridization, e.g., to an array of capture probes. In preferred embodiments 
a capture tag is a sequence not found at a terminus, and more preferably not found in the 
target sequence. A sequence complementary to the universal primer should be designed such 
that the universal primer can hybridize to and direct the amplification of a target nucleic acid 
sequence. 

The invention also features, a plurality of rolling circle amplifying probes (a set of 
probes), wherein each probe of the plurality of probes includes a unique capture tag sequence. 
An integration site nucleotide can differ between pairs, e.g., a plurality can include pairs with 
2, 3, or all of a, g, c, t as an interrogation site nucleotide. 

The total length of the primer can range between 10 to 400 nucleotides. In one 
example, the first region of the probe can range in length from between about 9 to 30, or 
between 10-15 nucleotides in length. The capture tag sequence within the second region of 
the probe can range in length between 2-100, preferably 4-20, more preferably 4-12 and most 
preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range between 5-40 
nucleotides in length. The third region of the sequence can range between 6-100, preferably 
7-50, more preferably 9-30 and most preferably 10-25 and 12-20 nucleotides in length. The 
fourth region of the probe can range in length from between about 9 to 50, or between 10-15 
nucleotides in length 

Each probe in the plurality should include a unique capture tag sequence so as to 
allow separate analysis. Thus, the capture tags will differ by at least 1 and preferably at at 
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i~ 2, 3 4, 5, 6, 10, or 20 nudeotides. ,„ pre fsrrc<i embodfaMa ^ ^ ^ 

tog in each of the plurality of probes will be sufficiently different that a sequence lag will not 

hybndtze, under assay conditions used, ,„ a sequence which is commentary to another 

"P^tografirese.inoraerwords.fit^wffinotcrossbybridize. 

can aiso be destgned to mining secondary strucrare, to pronto, efficient hybridization, e g 

ata^a^raorepraferabfynotfou^lrato^^^^^^^ 
araphfydte sense and andser^ s ^ ofato8Mcmhavethesanieordjffena] 
sequences. F s 

10 The probes can be used to den*, a genefic event e.g., a mutadon or an SNP in a 

^^-^-^.eraft.p^.Thep.ural^ofp^c^rafT.rrroraeach 
other a, the end terminus of fite probe. The prc*e, shouid include at their interrogation site 

""'^^^"^in^rragationsitoadegenera^nucta.idewhiehaDow.forligatton 
or polyraerase extension, only when hybridized to a specific targot 

invention features a pair of primers including: 

(a) a first primer which includes, preferably in the order of 5' to 3- 
20 a first region which includes a sequence complementary to a universal primer 

sequence; 

a second region which includes a cleavage site, e.g., . site for cleavage by , restriction 
enzyme sequence, e.g., a type lis enzyme andacapture tag sequence, wherein the capture 
sequence has preferably a different sequence man any other capture sequence- 

a th.rd region which includes a sequence which hybridizes to a first target sequence- 



and 



sequence. 



(b) a second primer which includes, preferably in the order of 5' to 3'- 
a first region which includes a sequence complementary to a universal primer 



a second region which includes a cleavage site, e.g., a site for cleavage by a restriction 
enzyme sequence, e.g., « type Gs enzyme and a capture tag sequence , wherein ^ ^ 
sequence has the same or different sequence than the capture sequence of the f rot primer and 
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a third region which includes a sequence which hybridizes to a second target 
sequence. 

The total length of the primer can range between 10 to 200 nucleotides. In one 
example, the first region of the primer can range in length from between about 9 to 30, or 
5 between 10-1 5 nucleotides in length. The capture tag sequence within the second region of 
the primer can range in length between 2-100, preferably 4-20, more preferably 4-12 and 
most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range between 5-40 
nucleotides in length. The third region of the sequence can range between 6-100, preferably 
7-50, more preferably 9-30 and most preferably 10-25 and 12-20 nucleotides in length. 

10 Each probe in the plurality should include a unique capture sequence tag so as to 

allow separate analysis. Thus, the capture tags will differ by at least 1 and preferably at least 
2, 3, 4, 5, 6, 10, or 20 nucleotides. In preferred embodiments the sequence of the capture tag 
in each of the plurality of probes will be sufficiently different that a sequence tag will not 
hybridize, under assay conditions used, to a sequence which is complementary to another 

1 5 capture tag in the set, in other words, they will not cross hybridize. Capture tag sequences 

can also be designed to minimize secondary structure, to promote efficient hybridization, e.g., 
to an array of capture probes. In preferred embodiments a capture tag is a sequence not found 
at a terminus, and more preferably not found in the target sequence. A set of probes which 
amplify the sense and antisense strand of a target can have the same or different capture tag 

20 sequences. 

In one embodiment, the target sequences can be different regions on the same 
molecule. In another embodiment, the target sequences can be on different molecules. 

The invention also features a plurality (or set) of pairs of primers, e.g., PCR primers, 
wherein each pair of the plurality includes a unique sequence tag. An integration site 
25 nucleotide can differ between pairs, e.g., a plurality can include pairs with 2, 3, or all of a, g, 
c, t as an interrogation site nucleotide. 

The total length of the primer can range, e.g., between 10 to 500 nucleotides. In one 
example, the first region of the primer can range in length from between about 9 to 30, or 
between 10-15 nucleotides in length. The capture tag sequence within the second region of 
30 the primer can range in length between 2-1 00, preferably 4-20, more preferably 4-12 and 

most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range between 5-40 
nucleotides in length. The third region of the sequence can range between 6-100, preferably 
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Ao. pn ne Bss e q nenoe i neaob P ai I „ f feep, U ra lif yo fp , tash o u , d ^ htoortqileso 

"'^^.'O.^nnsleofidn,^^ 

-^^^innnese^ofeer^.^^^^^ / 
^t»r,e. g .,fe m array„fe a p toreprobM . In prefemd 

setofpnobeswbiebanrplify fee sense and andsense strand of.^etcanlel 

^P- b -»^-cdfedefec,a g e»edeeven,e. g .,, muafiollorailSNPiina 
^ceven.eanbedegene.fein.feep^.^ ptaIi ^ fprobescM diffcr ^ " 

orperymeraseextenata, only when hybridized to aspeoific targe, 

He invention also featmes « pair of primers/probes, e.g., li 8 ,d„ n or PGR orim 

(a) a first primer in the order of 5 1 to 3'havmg- 

. firs, region which mcmdes a seouence compiementary to a tmiversa, primer 



ast K foro,eav. 8 eby, r es W edo„en Z ymese,„e™ e , e . g .,, typens5nzyme . 

adnrdmgionwhiehmclodesa^ce which hybfidizes ,„ a firs, fcrge, s^. 



and 

(b) a second primer in the order of 5" to 3'having: 
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a first region which can hybridize to a second region on the target nucleic acid 
sequence; 

a second region which includes a capture tag sequence, wherein the capture sequence 
has preferably a different sequence than any other capture sequence, and a cleavage site, e.g., 
5 a site for cleavage by a restriction enzyme sequence, and 

a third region which includes a sequence complementary to a universal primer 
sequence. 

The total length of the primer can range between 10 to 200 nucleotides. In one 
example, the first region of the primer can range in length from between about 9 to 30, or 

10 between 10-15 nucleotides in length. The capture tag sequence within the second region of 
the primer can range in length between 2-100, preferably 4-20, more preferably 4-12 and 
most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range between 5-40 
nucleotides in length. The third region of the sequence can range between 6-100, preferably 
7-50, more preferably 9-30 and most preferably 10-25 and 12-20 nucleotides in length. 

15 a capture tag sequence in each primer should each be unique so as to allow separate 

analysis. Thus, they will differ by at least 1 and preferably at least 2, 3, 4, 5, 6, 1 0, or 20. In 
preferred embodiments the sequence of the capture tag in each of the plurality of probes will 
be sufficiently different that a sequence tag will not hybridize, under assay conditions used, to 
a sequence which is complementary to another capture tag in the set, in other words, they will 

20 not cross hybridize. Capture tag sequences can also be designed to minimize secondary 

structure, to promote efficient hybridization, e.g., to an array of capture probes. In preferred 
embodiments a capture tag is a sequence not found at a terminus, and more preferably not 
found in the target sequence. A set of probes which amplify the sense and antisense strand of 
a target can have the same or different capture tag sequences. 

25 In one embodiment, the target sequences can be different regions on the same 

molecule. In another embodiment, the target sequences can be on different molecules. 

The invention also features a plurality of pairs of primers, e.g., ligation or PCR 
primers, which allow for the extension, e.g., amplification, of a set of target nucleotide 
sequences, wherein each of the pairs includes a unique sequence tag. An integration site 

30 nucleotide can differ between pairs, e.g., a plurality can include pairs with 2, 3, or all of a, g, 
c, t as an interrogation site nucleotide. 

The total length of the primer can range, e.g., between 10 to 500 nucleotides. In one 
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30 (b) a signal probe including, preferably in the order of 5' to 3 'including: 

optionally a signal sequence; 
a capture tag sequence; and 
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a region which can hybridize to a second region on the target nucleic acid sequence. 

The total length of the probe can range between 10 to 400 nucleotides. In one 
example, the capture tag sequence probe can range in length between 2-100, preferably 4-20, 
5 more preferably 4- 1 2 and most preferably 4-8 and 6-8 nucleotides in length. 

The capture tag sequences in the plurality of the probes should each be unique so as to 
allow separate analysis. Thus, they will differ by at least 1 and preferably at at least 2, 3, 4, 5, 
6, 10, or 20. In preferred embodiments the sequence of the capture tag in each of the 
plurality of probes will be sufficiently different that a sequence tag will not hybridize, under 
10 assay conditions used, to a sequence which is complementary to another capture tag in the 
set, in other words, they will not cross hybridize. Capture tag sequences can also be designed 
to minimize secondary structure, to promote efficient hybridization, e.g., to an array of 
capture probes. In preferred embodiments a capture tag is a sequence not found at a 
terminus, and more preferably not found in the target sequence. A set of probes which 
1 5 amplify the sense and antisense strand of a target can have the same or different capture tag 
sequences. In a pair of probes the capture tag sequences can differ or they can be identical. 

In one embodiment, the target sequences can be different regions on the same 
molecule. In another embodiment, the target sequences can be on different molecules. 
The invention also features a plurality of pairs of probes for the invader-directed 
20 cleavage, wherein each pair of the plurality includes a unique sequence tag. An integration 
site nucleotide can differ between pairs, e.g., a plurality can include pairs with 2, 3, or all of 
a, g, c, t as an interrogation site nucleotide. 

The total length of the probes can range between 10 to 400 nucleotides. The capture 
tag sequence can range in length between 2-100, preferably 4-20, more preferably 4-12 and 
25 most preferably 4-8 and 6-8 nucleotides in length. The cleavage site can range between 5-40 
nucleotides in length. The third region of the sequence can range between 6-100, preferably 
7-50, more preferably 9-30 and most preferably 10-25 and 12-20 nucleotides in length. 

Each probe of the plurality should include a unique capture tag sequence so as to 
allow separate analysis. Thus, capture tags will differ by at least 1 and preferably at at least 
30 2, 3, 4, 5, 6, 10, or 20 nucleotides. In preferred embodiments the sequence of the capture tag 
in each of the plurality of probes will be sufficiently different that a sequence tag will not 
hybridize, under assay conditions used, to a sequence which is complementary to another 



PCT/US00/19176 



.0 . anay ofcapbare probes. In preferred , ^ ag . a nM — 

^^^dmoropr^blyn,,.^^.^^^ A set of probes which 
amphfy the sense and amisense strand of a targe, can have «he sane or different capnne rag 
sequences. h a pair of probes me capnne tag sequences can differ or they can he identical 

In one embodiment, the targe, sequences can be different regions on the same 
molecute. in another embodiment the huge, sconces can be on different moiecoles 

denvanve nucletc acid sentences described herein. For exampie, the mvennon includes I 
denvanve nncieic acid sequence mcindmg one or more captnre rags and , targe, nucleic acid 

the end, of m. nncieic acid sequem*. ,„ on. example, me nncieic acid sequence is a donbie 
^^-■eetdemeludmg.SWernmtgwhichmchrdeaftecapmreag. Inanother 

exam P ,e,mennc,eieacidse,ne»ceiaado„b,eap™dedmol»ulei„el„di„ga5-overhan g 
wbtch includes me caphne rag. ,„ ye, another exampie, the nueieic acid ia , amgie at, Jed 
sequence including a capture probe at its 5' and 3' termini. 

A capture tag sequence is a sequence which allows identification of the probe or 
denvadvenucleic^ofwhichitiaapart A capture tag sequence ia preferably 2-10O more 

Pmbeaordenvativenucleicacidswhichmclud.capmreh.gaar.generallyusedinsete ie 
moretanoneiauaed.soutatmuWplexedan.lysiacanbeperformed. Tie aequence of ' " 
caph.re .gs on me moiecules of a setabonld each be unique so as to allow separate analysis 
TTms.theywdidtfferatatleastl and preferably at at least 2, 3. 4, 5, 6 10 or20 In 
preferred embodiments the sequence of the capture tag in each molecule of a se, will be 

-fficiendy different mataseqne^en.gwil, „„, hybndize, under assay comlidon, used to. 
sequence which is complemenhary m anofter capmra H g i„ a , ^ in other ' 

notcrosshybndize. Capmre tag sequences car, ahm be designed to minimize seconder 

«mbod,men K acapmretagis.se,ue„ce„o,fnu»da,atermi„ns,«„dmorep re fe ra b,y„„, 
found in the resequence. A setofprtners which amplify the senee and annsense snand 
of a rarge, can have ,he same or different captura rag sequences. 



WO 01/06012 



45 



PCT7US00/19176 



A universal primer is one which can hybridize to and direct the amplification of a 
plurality of different nucleic acids, e.g., a plurality of different derivative nucleic acids of a 
plurality of nucleic acids having different capture tags. 

A derivative nucleic acid is a nucleic acid which includes a capture tag sequence at 
5 one or both of its termini in a single strand overhang. The derivative sequence is produced 
only upon the hybridization of a probe which includes the sequence tag as an internal 
fragment hybridizes to the target sequence for which it is specific. 

The invention provides methods for the multiplexed analysis of complex mixtures of 
nucleic acids by the generation and capture of a plurality of species of derivative nucleic acid 

1 0 molecules, the generation of which is dependent on the presence of specific nucleic acid 

molecules in a sample. A plurality of derivative nucleic acid molecules is generated for each 
species of target nucleic acid molecule to be analyzed in the sample. The nucleic acid 
sequences at one terminal region of the derivative nucleic acid molecules are "arbitrary" tag 
sequences that are present initially as internal sequences in probes used to prepare the 

1 5 derivative nucleic acids. The derivative nucleic acid molecules are analyzed, e.g., captured 
and sorted by contiguous base stacking hybridization of their terminal tag regions to single- 
stranded overhangs of a plurality of spatially separated partially duplex probes. Hybridization 
of the tag sequences on the derivative nucleic acid molecules to the partially duplex probes is 
very specific due to the use of an optimized set of relatively short sequences in the single- 

20 stranded overhangs. The specificity may be enhanced by a requirement for ligation or 
polymerization reactions for successful capture. The invention is useful for sorting or 
demultiplexing the products of multiplexed amplification reactions or assays on microarrays 
or beads. 

Preferred embodiments feature tag sequences which are not found in the target nucleic 
25 acid sequences. This results in a more efficient, more specific assay, particularly when the 
tags in a reaction are chosen so as to minimize cross hybridization, i.e., hybridization of a 
first tag to a sequence complementary to a second tag. As the sequence tags need not and 
preferable are not able to hybridize to the tags under conditions under which the assay is 
performed, they can be tailored to provide efficient specific hybridization, e.g., to a capture 
30 array. 

The capture tags used in methods of the invention are initially supplied as internal 
sequences, i.e., they are flanked on both sides with other sequences. Only after a target 
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sequence specific reaction are tags found at the tenninus of a nucleic acid. The internal 
posraomng mak es it very difficult for un-reacted tag-continuing probes to hybridize to a 
capture probe. On the contrary, reacted molecules, which present the tag sequence at a 
te ~ybn^^ 
5 smgle strand overhang which is complementary with a tag sequence. 

Methods of the invention are useful for many app ,i cations quantification Qf 

the express levels of specific genes, detection of the allelic variants of multiple 
polymorphic sites within individuals, screening for genetic disease, pharmacogenomic 
-^oop^ 

0 ^dforensicanalysistoidenrifyhumanoranimalspeciesorindividuals ' ' 

Methods described herein provide for the multiplexed amplification and/or analysis of 
nuclei ac ld target sequences by the specific capture of molecules that are derived from probe 
modules ( 'target-specific probes") that interact with target molecules and that are encoded 
Wlth ^-ncesunr^ 
■^-thetarget-specificprobe^ 

probes w,th the nucleic acid target molecules, these tag sequences will be present at the 
termuu of molecules derived from the target-specific probes ("derivative nucleic acids"). 

The enhanced discrimination of partially duplex probes is exploited for the highly 
specie capture and sorting of the derivative nucleic acids. Target-specific probe molecules 
that have not interacted in specific ways with target sequences are not captured since the 
sequence tags are not present on their termini. Thus the microarrays of generic probes are 
used to effectively demultiplex multiplexed solution phase reactions that produce derivative 
nucleic acids. 

Partially duplex arrays can be used in two ways: 1) to capture and sort amplified 
target sequences resulting from multiplexed amplification reactions such as PGR or RCA 
and2)to capture and sort the reaction products of multiplexed assays such as Invader assays 
Denvatrve nucleic acid can be partitioned by specific hybridization and immobilized 
by nuance specific-recognition onto an array element containing oligonucleotide^) 
packaged into a device. Each array element can specifically capture cleaved fragments 
ansmg from a single initial recognition event representing the target nucleic acid in the 
onginal sample. The signal intensity detected at each element can then reflect the relative 
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abundance of the target molecule relative to other nucleic acid targets or alleles in the original 
sample. Thus, a multitude of array elements can capture a multitude of cleaved fragment 
species. 

Methods of the invention provide for multiplexing nucleic acid amplification and 
5 detection processes. This allows for the simultaneous analysis of a large number of nucleic 
acid fragments present in the sample. In the final detection step, a specific quantitative signal 
is detected for each allele or nucleic acid fragment present in the sample. Spatial separation 
of the amplification targets into distinct array elements eliminates competition of 
simultaneously amplifying targets and improves the fidelity and yield of the overall process. 
0 Further, the spatial separation provides a direct means of marking amplification targets that 
yield insufficient or no signals due to failed amplification reactions that potentially arise from 
failure of priming events or mispriming. These steps minimize generation of ambiguous data 
and facilitates troubleshooting attempts. 

Also described are a variety of detection methods for qualitative and quantitative 
1 5 determination of each nucleic acid or allele present in the sample. Detection methods can 
rely on, e.g., fluorescent signals, redox pairs, electronic detection, or enzymatic reactions. 

Devices for miniaturization and parallel analysis of nucleic acids in a sample can 
include, e.g., microplates, acrylamide gel pads, flow-through chips, and or other supports. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
20 meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although methods and materials similar or equivalent to those described herein can 
be used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references 
mentioned herein are incorporated by reference in their entirety. In case of conflict, the 
25 present specification, including definitions, will control. In addition, the materials, methods, 
and examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description, and from the claims. 
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Detailed Descrip tion 

Description of Figures 

Figure 1 is a diagram of the elements of primer adapters for multiplex PCR 
5 amplification with universal primer sequences and the capture and sorting of the derivative 
nucleic acids on partially duplex arrays. 

Figure 2 is an illustration of the protocol for multiplex PCR with two target-specific 
primers incorporating universal primer sequences, Type IIS restriction sites and generic 
capture tags, followed by capture of one or both target strands of the derivative nucleic acids. 
> Figure 3 is a diagram of the elements of 5' and 3' probe/primers that are ligatable to a 

specific nucleic acid sequence and that can be amplified by PCR with universal primers. 

Figure 4 is an illustration of the protocol for target-dependent ligation and PCR 
amplification of contiguous probes incorporating universal primer sequences, Type IIS 
restriction sites and generic capture tags, followed by capture of one or both strands of the 
derivative nucleic acids. 

Figure 5 is a diagram of the elements of circularizable probes containing universal 
primer sequences, Type IIS restriction sites and generic capture tags that can be used for 
target-dependent ligation and amplification, followed by capture of one or both strands of the 
derivative nucleic acids. 

Figure 6 is an outline of the Cleavase Invader assays (Third Wave Technologies) 
showing the elements of flap sequences that may be used for capture and detection on 
partially duplex probes. 

Partially Duplex Capture Probes 

Methods disclosed herein are performed with a plurality of partially duplex probes, 
each with a unique sequence in its region of single-stranded overhang. The overhangs may be 
3' or 5' overhangs, depended on the type of molecule that is to be captured. 

The partially duplex probes may be formed from two nucleic acid segments that are 
covalently linked and which form a partially self-complementary hairpin structure (see, e.g., 
US patent 5,770,365, hereby incorporated by reference). A chemical moiety such as an amine 
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group may be present on the partially duplex probe, preferably on the loop region of the 
hairpin, to effect covalent immobilization to a substrate. 

The partially duplex probes may also be formed from two nucleic acid segments that 
are bound by non-covalent interactions, e.g., by hydrogen bonding of complementary base 

5 pairs. A chemical moiety such as an amine group may be present somewhere on one of the 
nucleic acid segments to effect covalent immobilization to a substrate. It is preferred that the 
binding of the two nucleic acid segments be adequately stable so that they form stable 
complexes in solution, and hence may be utilized as a preformed reagent. Making the regions 
of complementary long enough to be stable at the temperatures and solution conditions in 

1 0 which they will be used will effect stable interactions between the segments. Incorporation of 
nucleic acid analogs such as peptide nucleic acids that enhance the stability of nucleic acid 
hybrids will also enhance the stability of the complexes. Attaching binding moieties to the 
segments can also be used to effect stable interactions between the segments. For example, 
biotin can be added to each of the segments, and the segments can be combined in 

1 5 approximately equimolar amounts and allowed to form hybrids in solution. A dimeric 

antibody to biotin can then be added to stabilize the hybrid. The antibody can also be used to 
immobilize the partially duplex probe. 

In one preferred embodiment the partially duplex probes are immobilized on 
positionally distinguishable elements of a microarray, which may be created on a non-porous 

20 substrate or on a three dimensional porous substrate. The three dimensional porous substrate 
may consist of a hydrophilic polymer matrix, such as derivatized polyacrylamide, to which 
the probes are covalently attached. 

In another preferred embodiment the partially duplex probes are immobilized on 
different beads that may be distinguished by an analytical means such the spectral properties 

25 of electromagnetic absorption or fluorescence emission. 

The single-stranded overhangs on the partially duplex probes may be from four the 
twenty bases long. A set of single stranded probe sequences is selected from all the possible 
sequences in the overhangs - e.g., 1024 for five base overhangs or 4096 for six base 
overhangs. Probe sequences are selected to provide highly specific and efficient capture with 

30 minimal cross-reaction between the probes. The sequences of the single-stranded probe 

overhang sequences are unrelated to target sequences in the samples to be analyzed. Rather, 
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the overhang sequences are complementary to the capture tag sequences that are initially 
present internally in probe molecules ("target-specific probes") that interact with the target 
nucleic acids in solution. As a result of specific interaction of the target-specific probes with 
the nucleic acid target molecules, these tag sequences will be present at the termini of 
molecules derived from the target-specific probes ("derivative nucleic acids"). 

Suitable restriction enzymes for use in the invention are listed in the table below. 



Enzyme 



ApaBI 

Bael 

Bbr7! 

Bpll 

Bsai 

BsmBI 

Bsp24! 

BstXI 

Bbvl 

BsmAI 

BsmFI 

Cjel 

CjePI 

Fokl 

HaelV 

Hgal 

Sth132l 

Stsl 



Type of 
overhang 



# bases in 
overhang 



Positional Arrays 

Positional arrays suitable for the present invention include high and low density arrays 
on a two dimensional or three dimen S1 onaI surface. Positional arrays include nucleic acid 
molecules, peptide nucleic acids or high affinity binding molecules of known sequence 
attached to predefined locations on a surface. Arrays of this nature are described in numerous 
patents which are incorporated herein by reference. These include, e.g., Cantor U S Patent 
No. 5,503,980; Southern, EP 0373 203 Bl; Southern, U.S. Patent No. 5,700,637 and Deugau 
U.S.PatentNo. 5,508,169. The density of the array can range from a low density format, ' 
e.g., a microliter plate, e.g., a 96- or 384- well microliter plate, to a high density format, e.g 
1 000 molecules/cm , as described in, e.g., Fodor, U.S. Patent No. 5,445,934. 

The surface on which the arrays are formed can be two dimensional e g glass 
plastic, polystyrene, or three dimensional, e.g. polymer gel pads, e.g. polyaoyiamide gel pads 
of a selected depth, width and height 
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In preferred embodiments, the target or probes bind to (and can be eluted from) the 
array at a single temperature. This can be effected by manipulating the length or 
concentration of the array or nucleic acid which hybridizes to it, by manipulating ionic 
strength or by providing modified bases. 



5 



Gel Pad Arrays 

Array-based method described herein can be practiced on gel pad arrays. Gel pads, 
including arrays of gel pads, can be prepared by a variety of methods, some of which are 
known in the art. Examples of these methods are provided in, e.g., Timofeev et al, Nucleic 
Acids Research (1996), Vol. 24, 3142-3148; Drobyshev et al, Gene (1997) 188: 45-52; 
Livshits etal, Biophysical Journal (1996) 71:2795-2801; Yershov etal., Proc. Natl. Acad. 
Sci. USA (1996) 93:4913-4918; Dubiley et al., Nucleic Acids Research (1997), Vol. 25, 
2259-2265; and U.S. Patent No. 5,552,270 by Khrapko et al. Each of the foregoing is 
incorporated herein by reference. Gel pad arrays are the preferred positional arrays for use in 
the methods described herein. 

In some embodiments, a sample which contains a target analyte, e.g., a 
polynucleotide, such as a sample which contains genomic DNA, is loaded into a gel pad. An 
array of gel pads on a first solid support can be employed to perform an analysis on a 
plurality of samples, or a plurality of probes to detect a plurality of characteristics, e.g., SNPs, 
of a sample or samples. The genomic DNA is preferably digested, e.g., with a restriction 
enzyme, to provide shorter fragments of DNA which can easily diffuse into the gel pad(s). 
The gel pad composition and/or the size of fragments can be selected to permit the target 
polynucleotides to diffuse into the gel pad, and/or to prevent larger pieces of, e.g., genomic 
DNA from diffusing into the gel pad. The volume of the gel pad(s) is preferably less than 
about 1 microliter, more preferably less than about 500, 100, 50, 10, 5, 1, .5, or 0.1 nanoliters 
per gel pad. Volumes in this range permit the diffusion of reactants and target to occur in a 
conveniently short time period (e.g., preferably less than 5, 2, 1, 0.5, or 0.1 minutes). After 
the sample polynucleotide has diffused into the gel pad, the remaining sample can be washed 
away. 

An "array" can be any pattern of spaced-apart gel pads disposed on a substrate. 
Arrays can be conveniently provided in a grid pattern, but other patterns can also be used. In 
preferred embodiments, a gel pad array includes at least about 10 gel pads, more preferably at 
least about 50, 100, 500, 1000, 5000, or 10000 gel pads. In some embodiments, the array is 
an array of gel pads of substantially equal size, thickness, density, and the like, e.g., to ensure 
that each gel pad behaves consistently when contacted with a test mixture. In certain 
embodiments, however, the pads of a gel pad array can differ from one another; e.g., a mixed 
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20 



gel pad array can be constructed which includes more than one size or type of gel pad e g 
gel pads made of different gel materials, or which entrap different species such as reagents'or 
polynucleotide probes. In certain preferred embodiments, gel pads in an array are less than 
about 1 mm m diameter (or along a side, e.g., in the case of square gel pads), more preferably 

lessen about 500 microns, still more preferably less than about 100, 75, 50, 25 10 5 or I 
micron in diameter. ' ' ' 

A gel pad can have any convenient dimension for use in a particular assay In 
preferred embodiments, a gel pad is thin enough, and porous enough, to permit rapid 
Affusxon of at least certain reaction components into the gel pad when a solution or 
suspense ts place din contact with the gel pad. For example, in one embodiment, a gel pad 
array for use in sequencing by hybridization permits polynucleotide fragments from a sample 
rmxture to drffuse (within a conveniently short time period) into the gel pads and hybridize to 
ohgonucleohde capture sequences disposed within the gel pads. In certain preferred 
embedments, a gel pad (e.g., in an array of gel pads) has a thickness of at least about 1 5 
10, 20, 30, 40, 50 or 100 microns. In certain preferred embodiments, a gel pad (e g in an 
array of gel pads) has a thickness of less than about I millimeter, 500 microns, 200 100 50 
40, 30, 20, 10, 5, or 1 microns. ' ' ' 

In preferred embodiments, a first gel pad (or each the first array of gel pads) includes 
afirstpnmer,e.g.,afir S tPCRprimer. The first primer is preferabiy complementary to at 
least a porhon of the sample polynucleotide (or to its complement). The first primer is 
preferably immobilized in the first gel pad to prevent migration of the primer out of the gel 
pad. The immohhzation can be permanent or reversible, and can be covalent or non- 
covalent. 
Example 1: 

Multiplex PCR with two target-speeific primers incorporating universal primer 
sequences, Type IIS restriction sites and generic capture tags, followed by capture of 
one or both target strands of the derivative nucleic acids 

Many pairs of target-specific PCR primers/adapters (Figure 1) are used for the 
multiplex amplification of many target sequences in a sample. Each of the PCR primers 
includes the following elements: 

1) Target-specific sequences, X, at the 3 > ends (Xf and Xr in Figure 2); 

2) Two universal primer sequences (Uf and Ur) at the 5' ends that are common to all 
the pairs of PCR primers used in an assay; 

3) The sequence for a Type ns restriction endonuclease (R); 
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4) One or two generic capture tag sequences (Tl and T2) at internal positions in the 
primers and at specific positions relative to the Type IIS restriction endonuclease 
recognition sites, such that a double stranded DNA molecule containing these 
sequences will he cleaved adjacent to the tag sequence leaving the tag sequence in 
5 a single-stranded overhang. 

The protocol is illustrated in Figure 2. PCR is performed in two phases with the 
greatest amplification accomplished in the second phase by universal primers complementary 
to Uf and Ur. (see e.g., Brownie et al, Nucleic Acids Research 25 :3235, 1997; U.S. patent 
5,858,989, Jeffreys et al; Favis et al, Nature Biotechnology 18 :561, 2000). 
1 0 The products are digested with a Type IIS restriction endonuclease to leave the tag 

sequences in single-stranded overhangs on the ends of the fragments, which products are 
called derivative nucleic acids. The generic tags are complementary to overhangs on partially 
duplex probes and hybridize to them. Ligation to the partially duplex probes followed by 
stringent washing results in the covalent capture of single-stranded DNA containing the 
1 5 sequence to be analyzed. Both strands of the target may be captured on separate array 

elements if Tl and T2 are different sequences. A polymorphism may be analyzed by primer 
extension, for example. Polymorphisms can be analyzed in both strands simultaneously on a 
set of positionally distinguishable partially duplex probes. 
Example 2: 

20 Target-dependent ligation and PCR amplification of contiguous probes 

incorporating universal primer sequences, Type IIS restriction sites and generic capture 
tags, followed by capture of one or both strands of the derivative nucleic acids 

Many pairs of target-specific PCR probe/primers (Figure 3) are hybridized and ligated 
to different target sites in the nucleic acid sample. Each of the probe/primers includes the 
25 following elements (Figure 3): 

1) Target-specific sequences at the 3' end (X3' and X 5', respectively) which are 
complementary to contiguous sequences of the targets and which can be 
enzymatically ligated if, and only if, the probe sequences match the target 
sequence (e.g., the position marked "SNP") at the termini; 
30 2) Two universal primer sequences (Uf and Ur) that are common to all the pairs of 

probes used in an assay; 
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3) The sequence for a Type IIS restriction endonuclease (R); 

4) A capture tag sequence (Tl or T2 on the 5' and 3 ' probe/primers, respectively) at 
internal positions and at specific positions relative to the Type IIS restriction 
endonuclease recognition site, such that a double stranded DNA molecule 

5 containing these sequences will be cleaved adjacent to the tag sequence leaving 

the tag sequence in a single-stranded overhang; 

5) An optional signal sequence situated between the restriction site and the target- 
specific sequence (not shown in Figure 3) that can encode information about the 
probe such as the identity of the nucleotide at the terminus of the probe. For the 
scoring of SNPs, one of four generic sequences would be present in the probes to 
encode one of the four possible bases. 

The protocol is illustrated in Figure 4. Ligated probe/primers are amplified by PCR 
using universal primer pairs complementary to Uf and Ur (common to all the circularizable 
probes used to analyze targets in a sample) to produce double stranded products (see e.g., 
Zhang, et aL, Gene 21 1 :277 (1998); US Patent 5,876,924). 

The resulting amplification products are digested with a Type IIS restriction enzyme 
which leaves the tag sequences (initially positioned internally in the circularizable probe) on 
single stranded overhangs on the products of the digestion. These products, called derivative 
nucleic acids, are then captured and sorted on partially duplex probes. Both strands of the 
target may be captured on separate array elements if Tl and T2 are different sequences. A 
polymorphism may be analyzed by primer extension, for example. Polymorphisms can be 
analyzed in both strands simultaneously on a set of positional^ distinguishable partially 
duplex probes. 

The presence of specific sequences in target nucleic acids in the sample may be 
deduced by the presence of derivative nucleic acids on specific partially duplex probes. Two 
alternative methods may be used (in this or other methods described herein) to analyze target 
sequences for specific nucleotides at polymorphic sites (SNPs): 

I) At least two different types of one of the probe/primers are prepared with different 
nucleotides at the site of ligation (complementary to the site of a SNP) and with 
different signal sequences that encode the identity of the nucleotide. Only the 
probe molecules that are complementary to the target sequence will be ligated and 
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amplified (Zhang, et al., Gene 211 :277 (1998); US Patent 5,876,924). All of the 
allele-specific probes for a SNP (i.e., two probes for a biallelic SNP) contain the 
same capture tag sequence, so all (both for a biallelic SNP) of the resulting 
derivative nucleic acids are captured by the same partially duplex probe. The 
allele(s) of the polymorphism is determined by detection the signal sequence with 
generic hybridization probes complementary to the signal sequences in the 
derivative nucleic acids. For a biallelic SNPs two generic hybridization probes 
labeled with distinguishable fluorophors will reveal the allele status of the SNP. 
2) Probe/primers are prepared with degenerate nucleotides at the ligation site 
(complementary to the site of a SNP). Only the probe molecules that are 
complementary to the target sequence will be ligated and amplified. The resulting 
derivative nucleic acids are captured on partially duplex probes, and the allele 
status of the target is determined by a variety of means, such as primer extension 
reactions. 
15 Example 3: 

Target-dependent ligation and amplification of circuiarizabie probes containing 
universal primer sequences, Type IIS restriction sites and generic capture tags, followed 
by capture of one or both strands of the derivative nucleic acids 

Many different circuiarizabie probes are hybridized and ligated to different target sites 
20 in the nucleic acid sample. Each of the circuiarizabie probes includes the following elements 
(Figure 5): 

1) Target-specific sequences at the 3 ' and 5' ends (X3' and X 5', respectively) which 
are complementary to contiguous sequences of the targets and which can be 
enzymatically ligated if, and only if, the probe sequences match the target 

25 sequence (e.g., the position marked "SNP") at the termini; 

2) Two universal primer sequences (Uf and Ur) that are common to all the 
circuiarizabie probes used in an assay; 

3) The sequence for a Type IIS restriction endonuclease (R); 

4) One or two capture tag sequences (Tl and T2) at internal positions and at specific 
30 positions relative to the Type IIS restriction endonuclease recognition sites, such 

that a double stranded DNA molecule containing these sequences will be cleaved 
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adjacent to the tag sequence leaving the tag sequence in a single-stranded 
overhang; 

5) An optional signal sequence (S) that can encode information about the probe 
such as the identity of the nucleotide at one of the termini of the probe. For the 
5 scoring of SNPs, one of four generic sequences would be present in the probes 

to encode one of the four possible bases. 

Ligated (circularized) probes are amplified using universal primers (common to all 
the cncularizable probes used to analyze targets in a sample) to produce double stranded 
products (see e.g., Thomas et aL. Arch Pathol Lab 123:1170 1999; Zhang et al Gene 
> 2U :277 (1998); US Patent 5,876,924). The priming sites are situated in the circularizab~ 
probe so that the amplification is ligation dependent (see e.g., Thomas et al., Arch Pathol Lab 
Med_123:l 170, 1999; Zhang, et al., Gene211:277 (1998); US Patent 5,876,924), and so that 
the target dependent sequence, the type II S restriction site, the capture tag sequence and the 
optional signal sequence are amplified. The direction in which the primers are extended is 
indicated by the arrows inTigure 5. 

The resulting multimer amplification products are digested with a Type IIS restriction 
enzyme which leaves the generic tag sequences (initially positioned internally in the 
circularise probe) on single stranded overhangs on the products of the digestion These 

products, called derivative nucleic acids, are then captured and sorted on partially duplex 

probes. 

The presence of specific sequences in target nucleic acids in the sample may be 
deduced by the presence of derivative nucleic acids on specific partially duplex probes Two 
alternative methods may be used to analyze target sequences for specific nucleotides at 
polymorphic sites (SNPs): 

1) Separate circularizable probes are prepared with different nucleotides at the site of 
ligation (complementary to the site of a SNP) and with different signal sequences 
that encode the identity of the nucleotide. Only the probe molecules that are 
complementary to the target sequence will be ligated and circularized. All of the 
allele-specirlc probes for a SNP (i.e., two probes for a biallelic SNP) contain the 
same capture tag sequence, so all (both for a biallelic SNP) of the resulting 
derivative nucleic acids are captured by the same partially duplex probe. The 
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allele(s) of the polymorphism is determined by detection the signal sequence with 
generic hybridization probes complementary to the signal sequences in the 
derivative nucleic acids. For a biallelic SNPs two generic hybridization probes 
labeled with distinguishable fluorophors will reveal the allele status of the SNP. 
5 2) Probes are prepared with degenerate nucleotides at the ligation site 

(complementary to the site of a SNP). Only the probe molecules that are 
complementary to the target sequence will be ligated and circularized. The 
resulting derivative nucleic acids are captured on partially duplex probes, and 
the allele status of the target is determined by a variety of means, such as 
1 o primer extension reactions. 



Example 4: 

Specific capture and sorting of flap products from multiplexed Cleavase Invader 
assays on partially duplex arrays 

1 5 Many different pairs of Invader probes are hybridized to different target sites in the nucleic 
acid sample. Each pair consists of an invader oligonucleotide which forms a relatively stable 
hybrid with the target nucleic acid and a signal probe which hybridizes transiently to the 
target at a position adjacent to the invader probe (see e.g., Ryan et al., Mol Diagn 4 : 135, 
1999, Griffin et al., Proc Natl Acad Sci U S A 96 :6301, 1999). Many signal probes cycle on 

20 and off the target sequence, and they are cleaved by an endonuclease if and only if the 

sequence of the signal probe matches the target at the overlap between the signal probe and 
the invader probe. The signal probe includes the following elements (Figure 6): 

1) A3' region with sequence complementary to the target; 

2) An arbitrary capture tag sequence that is initially positioned internally and that 
25 includes at least one target-specific nucleotide that will be present on the 3 ' 

terminus of the expected cleavage product; 

3) A 5' region which can have an arbitrary signal sequence or a label moiety for 
detection of the product. 
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The flap products (derivative oligonucleotides) resulting from a multiplexed invader 
assay are captured and sorted by different partially duplex probes. The label on the 5' end of 
the flap can encode information about the probe that was cleaved such as the identity of the 
nucleotide in the overlap between the invader probe and the signal probe and hence the allele 
of a polymorphism. For a bialldic SNPs two generic hybridization probes complementary to 
the generic signal sequences and labeled with distinguishable fluorophors will reveal the 
allele status of the SNP. Alternatively, the generic hybridization probes used to detect the 
captured flaps (derivative oligonucleotides) can serve as generic primer sequences for rolling 
circle amplification with preformed circle. With this method of detection great signal 
amplification can be achieved. 
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1 . A method for multiplexed analysis of a plurality of target nucleic acid sequences in 
a sample comprising the steps of: 

providing, for each target nucleic acid sequence to be analyzed, at least one 
probe/primer molecule which probe/primer molecule includes a region of sequence 
5 substantially complementary to a sequence in the target nucleic acid sequence and a region 
that is not located at either terminus of the probe/primer and which includes a capture tag 
sequence; 

forming a reaction mixture which includes the probe/primer molecules and the target 
sequences under conditions such that, if a probe/primer molecule specific for a target 
10 sequence and that target sequence are both present, one or a plurality of derivative molecules 
having a capture tag at one or both its 3' or 5' termini, of the probe specific for the target 
sequence, is generated, thereby producing a derivative nucleic acid suitable for evaluation; 

evaluating the presence of one or more capture sequence tags. 

15 2. The method of claim 1, wherein the derivative nucleic acid molecules are analyzed 

by hybridizing the tag sequences to capture probes which are spatially separated. 

3. The method of claim 2, wherein the capture probes are partially duplex probes 
with capture tag-complementary single stranded overhangs 

20 

4. The method of claim 2, wherein the capture tags are disposed on beads. 

5. The method of claim 2, wherein the capture tags are disposed on an ordered array. 

25 6. The method of claim 2, wherein the derivative nucleic acid is ligated to a capture 

probe and then washed. 

7. A method of providing a derivative nucleic acid having single-strand overhang 
suitable for analysis comprising: 
30 providing a first and second primer, 

wherein said first primer includes, in the order of 5' to 3 ', 

a first region which includes a universal primer sequence, 
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a second region which includes a capture tag sequence and a 
cleavage site, and 

a third region which can which can hybridize to a first region on 
the target nucleic acid sequence, 
5 wherein said second primer includes, in the order of 5' to 3', 

a first region which includes a universal primer sequence 
a second region which includes a capture tag sequence (the capture 

tag sequence on me secondprimer can be the same or different from thatofthe 
capture tag of the first primer) and a cleavage site, and 

a third region which can which can hybridize to a second region on the 
target nucleic acid sequence, 

pnmer sequence; universal 



pnmer sequence; »«»aj 
thereby producing a target molecule having an overhang. 
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8. The method of claim 7, wherein the method further includes multiplexed reactions 
and the reaction mix includes a second target, and a third and a forth primer are included in 
the reaction mix, 

5 wherein the third primer includes, in the order of 5 ' to 3 

a first region which includes a universal primer sequence, 
a second region which includes a capture tag sequence (which is 
different from the capture tag sequence on one or both of the first and second 
primers) and a cleavage site, e.g., a cleavage site for cleavage by a restriction 
10 enzyme, and 

a third region which can hybridize to a first region on the second 

target, 

wherein the fourth primer includes, in the order of 5' to 3', 

a first region which includes a universal primer sequence; 
15 a second region which includes a capture tag sequence and a cleavage 

site, e.g., a site for cleavage by a restriction en2yme, and 

a third region which can which can hybridize to a second region on the 

second target, 

forming a reaction mixture which includes the first, second, third and fourth primers 
20 and the two targets, and using the second target as a template, extending the third and fourth 
primers along the second target, to produce an extended second target strand, which includes, 
in order, a universal primer sequence, a capture tag sequence, target sequence, a capture tag 
sequence, and a universal primer sequence (the extended second target will include a capture 
tag sequence which is different from that on the extended first target); 
25 contacting an extended second target strand with a universal primer which bind to the 

universal primer sequence and extending the universal primers along the extended second 
target strand to synthesize a extended second target strand which includes, in order, a 
universal primer sequence, a capture tag sequence, target sequence, a capture tag sequence, 
and a universal primer sequence (wherein at least one capture tag is different from a capture 
30 tag on the first extended target); 

cleaving at the cleavage site of one or both ends of a double stranded second extended 
target molecule to provide a derivative nucleic acid which is a double stranded molecule 
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10. The method of claim 9, wherein the capture probes are partially duplex Dro b, 
w,th capture tag-complementary single stranded overhang, ? 

U. The method of claim 9, wherein the capture tags are disposed on bead, 

13. The method of claim 9, wherein the derivative nucleic ^ 

probe and then washed. S ^ to a capture 

14. A method of using ligatable probes to provide a nucleic acid having single strand 
overhangs suitable for analysis comprising: *ngle-strand 

providing a first and second probe, 

wherein said first probe include, in the order of 5' to 3', 

a first region which includes a universal primer sequence- 

a second region which includes a capture tag sequence and a cleavage 



site, and 



when™ said second proper jn the order of 3* to 5' 

a firs, region which can which can hybridise * a second region „„ , he 

target nucleic acid, 

a second region which incMes , ^ Bg 

^enceondtesecondprohe/prhnerc.nhenresa.eordrrferen^oLtor 
•ha caphrre ag of the fa ^ , 
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a third region which includes a universal primer sequence; 
forming a reaction mixture which includes the first and second primers and the target 
nucleic, under conditions wherein the first and second probes are joined if the target is of a 
first sequence and not joined if the target is of a second sequence, to produce a joined probe, 
which includes, in order, a universal primer sequence, a capture tag sequence, target 
sequence, a capture tag sequence, and a universal primer sequence, preferably in the order 5" 
to 3'; 

cleaving at the cleavage site of one or both ends of a double stranded extended target 
molecule to provide a derivative nucleic acid which is a double stranded molecule having 
overhangs which include the capture tag sequence at one or both of the 3' and 5' termini, 

thereby producing a target molecule having overhangs. 

15. The method of claim 14, wherein the method further includes multiplexed 
reactions and the reaction mix includes a second target, and a third and a forth probes are 
included in the reaction mix, 

wherein said third probe includes, in the order of 5' to 3', 

a first region which includes a universal primer sequence; 
a second region which includes a capture tag sequence (which is 
different form the capture tag sequence on one or both of the first and second 
probe) and a cleavage site, and 

a third region which can which can hybridize to a first region on the 
target nucleic acid, 

wherein said fourth probe includes, in the order of 5' to 3', 

a first region which can which can hybridize to a second region on the 

target nucleic acid, 

a second region which includes a capture tag sequence and a cleavage 

site, and 

a third region which includes a universal primer sequence; 
forming a reaction mixture which includes the first, second, third and fourth probe 
and the two targets under conditions wherein the third and fourth probe are joined if the 
second target is of a first sequence and not joined if the second target is of a second sequence, 
to produce a second joined probe, which preferably includes, in order, a universal primer 
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rapture probe and then washed. E««oroa 

mold™ 21 ' " f P " >V '* 8 * °" deiC * ** — -"hang suitable for 

multiplex analysis comprising: <"»eior 



sequences ; 



25 



linear probe includes, 



Wprovidingaaantp.e which inciude, oneor ap^ 0 f argeUucleio acid 



^ffget, and at i ts other terminus a second region which is complementary to a second region ona 
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a cleavage site 

a capture tag sequence, wherein the cleavage site and capture tag sequence are 
disposed such that cleavage results in a single-stranded overhang, at one or both of the 3' and 5' 
termini, and 
5 a universal primer sequence; 

(3) providing a second single stranded linear probe, wherein the second single- 
stranded probe includes, 

at one terminus, a first region which is complementary to a first region on a second 
target, and at its other terminus a second region which is complementary to a second region on a 
1 0 second target, wherein the first and second region on the first target can be directly or can be 
separated by one or more nucleotides, 

a cleavage site 

a capture tag sequence, which capture sequence can differ in sequence from the 
capture tag on the first single stranded probe, and wherein the cleavage site and capture tag 
1 5 sequence are disposed such that cleavage results in a single-stranded overhang, at one or both of 
its 3' and 5' termini, and 

a universal primer sequence, which is preferably, of the same sequence as the 
universal primer sequence on the first single stranded probe; 

contacting the first single stranded probe with the first target and the second single 
20 stranded probe with the second target under conditions which allow the circularization of the a 
single stranded circular probe if to be circularized if a target is present which is homologous to its 
terminal target binding regions; 

contacting the first and second probes with a universal primer under conditions which 
allow rolling circle amplification and produce double stranded amplification product; 
25 cleaving the double stranded amplification product at cleavage sites, e.g., with a 

restriction enzyme to provide cleaved product having a capture tag sequence at the terminus of a 
single stranded overhang. 

22. The method of claim 21 , wherein the derivative nucleic acid molecules are 
30 analyzed by hybridizing the tag sequences to capture probes which are spatially separated. 

23. The method of claim 22, wherein the capture probes are partially duplex probes 
with capture tag-complementary single stranded overhangs. 
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5 array. 



25. The^odofcUta 22, wherein cap^ ags ra fflsposed OT a „ ^ 



15 



26. The method of claim 22, wherein the derivative nucleic acid is ligated to a 
capture probe and then washed. 



27. A^odofprovidinganucleicacidwiths^estrandoverhangsuitablefor 
detecting one or more genetic events in a sample comprising: 

two regions preferably overlap; wwnme 
^ providing an invader probe which is complementary to the first region of the 

providing a signal probe having, in the 5' to 3' direction, signal sequence a 
capture tag sequence, andaregion -p.ementa.y to Ae second region of the targets 

. Q>) contacting the target sequence with the invader probe and the signal probe 

annealed to the target nucleic acid sequence so as to create a cleavage s^re hlng 
single-stranded arm which includes the capture tag; 

(0 cleaving the first cleavage structure under conditions such that cleavage of the 

theannea n g ofthemvader and signal probes on the tar g et n ucleic acid such J cleavage 
liberates!^ 

acid wh 1C h has the capture tag sequence at a termmus; 
thereby providing a derivative nucleic acid suitable for analysis. 
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28. The method of claim 27, wherein the method further includes multiplexed 
reactions and the reaction mix includes a second target, and a third and a forth probes are 
included m the reaction mix, wherein, 

the second invader probe is complementary to a first region of a second target, 
5 the second signal probe includes, in the 5' to 3' direction, signal sequence, a capture 

tag sequence (which is different in sequence from the capture tag sequence on the first signal 
probe), and a region complementary to a second region of the second target nucleic acid; 

(b) contacting the second target sequence with the second invader probe and the 
second signal probe, under conditions wherein the second invader probe and an end, e.g., the 

10 3' end of the second signal probe are annealed to the second target nucleic acid sequence so 
as to create a second cleavage structure having a single-stranded arm which includes the 
capture tag; 

(c) cleaving the second cleavage structure under conditions such that cleavage of 
the second cleavage structure occurs at a site located within the second signal probe in a 

1 5 manner dependent upon the annealing of the second invader and signal probes on the target 
nucleic acid such that cleavage liberates the single-stranded arm of the cleavage structure to 
generate a second derivative nucleic acid which has the capture tag sequence at a terminus; 

29. The method of claim 27, wherein the derivative nucleic acid molecules are 

20 analyzed by hybridizing the tag sequences to capture probes which are spatially separated. 

30. The method of claim 29, wherein the capture probes are partially duplex probes 
with capture tag-complementary single stranded overhangs. 

25 31. The method of claim 29, wherein the capture tags are disposed on beads. 

32. The method of claim 29, wherein the capture tags are disposed on an ordered 

array. 

30 33. The method of claim 29, wherein the derivative nucleic acid is Iigated to a 

capture probe and then washed. 
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34. A plurality of primers as described in claim I, 7, 14, 21, or 27. 

35. A plurality of derivative nucleic acids as described in claim 1, 7, 14, 2 1 , or 27. 
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